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In the claims : 

Please replace the following claims: 

4. (Amended) A DNA encoding the protein or peptide of claim 1 . 

7. (Amended) A method of producing a protein or peptide comprising the steps 
of: cultivating the transformant of claim 6, and recovering the protein or peptide 
expressed therein. 

8. (Amended) A method of screening for a compound that is capable of binding 
to the protein of claim 1, comprising the steps of: 

(a) exposing a test sample to the protein or peptide of claim 1 , and 

(b) selecting the compound that binds to the protein or peptide of claim 1 . 
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9. (Amended) A method of screening for a ligand and/or agonist that is capable 
of binding to the protein of claim 1 , comprising the steps of: 

(a) exposing a test sample to a cell expressing the protein or peptide of any of 
claims 1 to 3 on its surface, 

(b) measuring a biochemical change in said cell, and 

(c) selecting the compound that induces said biochemical change in said cell. 

1 1 . (Amended) A method of detecting or measuring a protein or peptide 
comprising the steps of: exposing the antibody of claim 10 to a sample which is assumed 
to comprise said protein or peptide, and detecting or measuring the generation of an 
immune complex between said antibody and said protein or peptide. 
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Version with markings to show changes made 

In the claims : 

4. (Amended) A DNA encoding the protein or peptide of [any of claims 1 to 3] 
claim 1. 



7. (Amended) A method of producing [the] a protein or peptide [of any of 
claims 1 to 3,] comprising the steps of: cultivating the transformant of claim 6, and 
recovering the protein or peptide expressed therein. 

8. (Amended) A method of screening for a compound that is capable of binding 
to the protein of claim 1 , comprising the steps of: 

(a) exposing a test sample to the protein or peptide of [any of claims 1 to 3] 
claim 1 , and 

(b) selecting the compound that binds to the protein or peptide of [any of claims 1 
to 3] claim 1 . 

9. (Amended) A method of screening for a ligand and/or agonist that is capable 
of binding to the protein of claim 1, comprising the steps of: 

(a) exposing a test sample to a cell expressing the protein or peptide of [any of 
claims 1 to 3] claim 1 on its surface, 

(b) measuring a biochemical change in said cell, and 

(c) selecting the compound that induces said biochemical change in said cell. 

1 1 . (Amended) A method of detecting or measuring [the] a protein or peptide 
[of any of claims 1 to 3,] comprising the steps of: exposing the antibody of claim 10 to a 
sample which is assumed to comprise said protein or peptide, and detecting or measuring 
the generation of an immune complex between said antibody and said protein or peptide. 
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DESCRIPTION 



NOVEL G PROTEIN-COUPLED RECEPTORS 



Technical Field 

The present invention relates to novel G protein-coupled receptor 
proteins and their genes, as well as methods for producing and using 
same. 



10 Background Art 

The olfactory receptor (OR) gene family has been known as a 
multiple gene family encoding odorant receptors that govern the system 
for olfactory response in the living body. A member of the gene family 
was first reported to be expressed in rat olfactory neurons in 1991 

15 (Buck L. et al. Cell 65:175-187 (1991)), and subsequently, a number 
of homologous genes were found across species, in canine (Permentier 
M. et al. Nature 355:453-455 (1992) ) , mouse (Ressler K.J. et al . Cell 
73:597-609 (1993) ) , human (SelbieL.A. etal. Mol. Brain Res. 13:159-163 
(1992)), catfish (Ngai J. et al. Cell 72:657-666 (1993)), and frog 

20 (FreitagJ. etal. Neuron 15 : 1383-1392 (1995) ) . The receptor proteins 
encoded by the genes of the family have a unique seven transmembrane 
structure that is similar to other G protein-coupled receptor 
subfamilies, and their first extracellular domain contains a common 
[N-x-S/T] motif (x: any amino acid) which is.modif ied by sugar. Their 

25 second and third extracellular domains , loop- 1 and loop-2 , respectively , 
are cross-linked by a disulfide bond between the conserved cysteine 
residues of each. The [M-A-Y-D-R-Y-L/V-A-I/V-C ] sequence within the 
second intracellular domain is another common motif predicted to be 
functionally important. Especially, the [D-R-Y] motif is suggested 

30 to be essential for the binding to intracellular G proteins (Rosenthal 
W. et al. J. Biol. Chem. 268:13030-13033 (1993); Marchese A. et al. 
Genomics 23:609-618 (1994)). 

Although it is predicted that there exist ligands for the odorant 
receptor family , such as odour ligands and neuropeptide-like molecules , 

35 the exact nature has remained obscure (Buck L. et al. Cell 65:175-187 
(1991) ; Ngai J. et al. Cell 72:657-666 (1993) ) . Little is known about 
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their signal transduction except for coupling with G proteins, and 
thus functional analysis of the gene family is desired. Recently, 
it has been proposed to classify the entire gene family into eight 
subfamilies according to structural similarity. However, it is 
5 unlikely that the classification itself can reflect functional 
similarity among the genes . Inhuman, it is assumed that approximately 
one thousand copies of this gene family exists . However, about 70% 
of the copies are assumed to be pseudo genes (Sylvie R. et al. Nature 
Gen. 18:243-250 (1998)). 

10 

Disclosure of the Invention 

The present invention provides novel G protein-coupled receptor 
proteins and their genes , as well as materials and methods for producing 
them, molecules used for the method and such, and their use. 

15 The present inventors performed BLAST search of the GenBank 

public database and found multiple human genome sequences that encode 
novel seven transmembrane G protein-coupled receptor genes. The 
inventors carried out RT-PCR using oligonucleotide primers that were 
designed based on the above genome sequences and mRNA prepared from 

20 human tissues as a template, and obtained partial cDNA sequences of 
the target genes . This RT-PCR was also used to evaluate the expression 
pattern and profile of the genes in those tissues . Then , the inventors 
carried out 5' - and 3'- RACE using a cDNA library prepared from human 
testis as a template, in which the genes were highly expressed based 

25 on the result of RT-PCR, and succeeded in isolating the full length 
cDNA. The first three genes isolated, originating from human 
chromosome 14, are designated as GTAR14-1, GTAR14-3, and GTAR14-5, 
and the next four genes isolated, originating from human chromosome 
11, are designated as GTAR11-1, GTAR11-2, GTAR11-3, and GTAR11-4. 

30 These genes encode novel seven transmembrane G protein-coupled 

receptors, and fulfill the characteristics of the known olfactory 
receptor family genes. The result of the analysis of gene expression 
in human tissues showed that GTAR14-1 was detected strongly in 
lymphatic /hematopoietic tissues, including thymus and spleen, testis, 

35 and pancreas, and was also expressed in fetal thymus. GTAR14-3 was 
detected strongly in thymus and testis, but was also expressed widely 
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in multiple tissues, including digestive tissues such as small 
intestine and colon, and hormone secreting tissues such as placenta 
and prostate. GTAR14-5 was detected strongly in 

lymphatic/hematopoietic tissues, such as thymus, spleen, and 
5 peripheral leukocytes, and testis. Its expression was intensified 
in fetal thymus, fetal spleen, and fetal lung, and was detected weakly 
in fetal brain and fetal kidney. GTAR11-1 was detected strongly in 
lymphatic/hematopoietic tissues including thymus, spleen, and 
peripheral lymphocytes, digestive tissues such as small intestine 

10 and colon, and testis and placenta. GTAR11-2 was detected strongly 
in lymphatic/hematopoietic tissues including thymus and peripheral 
lymphocytes , and testis and placenta . GTAR1 1-3 was detected strongly 
in lymphatic/hematopoietic tissues including thymus and spleen, and 
genital organs, and was also detected broadly in multiple tissues 

15 including digestive tissues such as small intestine and colon, muscle, 
and neuronal tissues. GTAR11-4 was detected strongly in 

lymphatic/hematopoietic tissues including thymus and spleen, and 
genital organs , and was also extensively expressed in multiple tissues 
including digestive tissues such as small intestine and colon, and 

20 endocrine tissues. 

These expression profiles suggest that proteins encoded by those 
genes have an important function in lymphatic/hematopoietic and 
endocrine tissues . Consequently, it was considered that the proteins 
can be used for screening a novel hematopoietic peptide factor or 

25 a novel hormone-like peptide factor. 

The present invention relates to novel G protein-coupled receptor 
proteins and their genes, and their production and use as well. More 
specifically, it provides: 

( 1 ) a G protein-coupled receptor protein selected from the 

30 group consisting of: 

(a) a protein having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 4, 5, 6, 28, 29, 30, and 31; 

( b ) a protein of ( a ) , wherein one or more amino acids are modified 
by deletion, addition, insertion, and/or substitution by another amino 

35 acid residue; and 

(c) a protein encoded by DNA that hybridizes with DNA having 
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a nucleotide sequence selected from the group consisting of SEQ ID 
NO: 1, 2, 3, 24, 25, 26, and 27; 

( 2 ) a fusion protein comprising the protein of ( 1 ) and another 
peptide or polypeptide; 
5 (3) a peptide comprising a part of the protein of (1); 

(4) a DNA encoding the protein or peptide of any of (1) to 

(3); 

(5) a vector comprising the DNA of (4) inserted therein; 

(6) a transformant carrying the DNA of (4) in an expressible 
10 manner ; 

(7) a method of producing the protein or peptide of any of 
(1) to (3), comprising the steps of: cultivating the transformant 
of (6), and recovering the protein or peptide expressed therein; 

(8) a method of screening for a compound that is capable of 
15 binding to the protein of (1), comprising the steps of: 

(a) exposing a test sample to the protein or peptide of any of 
( 1 ) to ( 3 ) ; and 

(b) selecting the compound that binds to the protein or peptide 
of any of (1) to (3) ; 

20 (9) a method of screening for a ligand and/or agonist that 

is capable of binding to the protein of (1), comprising the steps 
of: 

(a) exposing a test sample to a cell expressing the protein or 
peptide of any of (1) to (3) on its surface; 
25 (b) measuring a biochemical change in said cell; and 

(c) selecting the compound that induces said biochemical change 
in said cell; 

(10) an antibody that binds to the protein of (1); 

( 11 ) a method of detecting or measuring the protein or peptide 
30 of any of (1) to (3), comprising the steps of: exposing the antibody 

of ( 10 ) to a sample which is assumed to comprise said protein or peptide, 
and detecting or measuring the generation of an immune complex between 
said antibody and said protein or peptide; and 

(12) A DNA of a length of 15 nucleotides or longer that 
35 hybridizes with a DNA having a nucleotide sequence selected from the 

group consisting of SEQ ID NO: 1, 2, 3, 24, 25, 26, 27, and their 



complementary strands . 

The nucleotide sequences of GTAR14-1, GTAR14-3, and GTAR14-5 
cDNAs that were isolated by the present inventors, and the amino acid 
sequences of the proteins encoded by the cDNAs are shown in SEQ ID 
NO: 1, 2, 3, 4, 5, and 6, respectively. The nucleotide sequences 
of GTAR11-1, GTAR11-2, GTAR11-3, and GTAR11-4 cDNAs that were also 
isolated by the present inventors, and the amino acid sequences of 
the proteins encoded by the cDNAs are shown in SEQ ID NOs: 24, 25, 
26, 27, 28, 29, 30, and 31, respectively. 

The receptor proteins of the OR gene family have a seven 
transmembrane type structure like other G protein-coupled receptors, 
and their first extracellular domains have a [N-x-S/T] motif (x: any 
amino acid) in common that is modified by sugar. There are conserved 
cysteine residues in the second (loop-1) and third (loop-2) 
extracellular domains that form a disulfide bond for cross-linking. 
Furthermore, there is a [M-A-Y-D-R-Y-L/V-A-I/V-C ] sequence 
(especially [D-R-Y] motif) in the second intracellular domain, which 
is suggested to be essential for binding to the G protein (Rosenthal 
W. et al. J. Biol. Chem. 268:13030-13033 (1993); Marchese A. et al. 
Genomics 23:609-618 (1994)). 

The seven proteins encoded by the seven genes isolated by the 
present inventors (designated as GTAR proteins below) are cell membrane 
proteins and are seven transmembrane receptors that contain seven 
hydrophobic transmembrane domains. 

GTAR proteins, like known odorant receptors, have the above 
described motifs characteristic to G protein-coupled receptors . The 
[N-x-S/T] motif (x: any amino acid) modified by sugar is found within 
the first extracellular domain as [N-Q-T], [NttS-T], and [N-T-S] in 
GTAR14-1, GTAR14-3, and GTAR14-5 proteins, respectively, as [N-Y-S] 
in GTAR1 1-1 protein, and as [N-S-T] in GTARl 1-2 , GTARll-3, and GTARl 1-4 
proteins . 

GTAR proteins have conserved cysteine residues in the second 
and third extracellular domains (loop-1 and loop-2, respectively). 
Furthermore, they have the [M-A-Y-D-R-Y-L/V-A-I/V-C] sequence in the 
second intracellular domain, which is suggested to be essential for 
binding to G protein, as well: [ V-A-Y-D-R-Y-V-A-I-C ] in GTAR14-1 
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protein, and [M-A-Y-D-R-Y-L-A-I-C] in all the others. 

These features suggest that GTAR proteins of the present 
invention, like other G protein-coupled receptor proteins, function 
through binding to intracellular G proteins in response to the stimuli 
5 of extracellular ligands. 

Furthermore, the result of RT-PCR analysis revealed a strong 
expression of GTAR genes in thymus and testis (See, for example, Example 
1(3) and 2(3), Figures 7 to 9, and Figures 20 to 23). This suggests 
that the GTAR proteins of the present invention may regulate the 

10 differentiation, or proliferation and activation of a particular group 
of lymphatic cells. Thus, GTAR proteins could function as a central 
molecule that controls the activities characteristic to lymphocytes 
through interaction with G proteins in response to ligand stimuli. 
As these proteins are expressed in testis, where cells are changing 

15 continuously in the cell cycle at a high turn over rate, it is possible 
that they regulate cell cycle and cell division. 

On the other hand, the expression of GTAR14-1 gene was detected 
in spleen, where hematopoietic cells are expected to exist. The 
GTAR14-5 gene was also expressed in spleen and peripheral leukocytes, 

20 where hematopoietic cells are thought to be contained. The GTARll 
genes were expressed in peripheral leukocytes, in which hematopoietic 
cells are considered to be included, and furthermore the genes for 
GTAR11-1, GTARll-3, and GTAR11-4 were expressed in spleen, which is 
expected to have hematopoietic cells. These suggest that GTAR 

25 proteins, particularly GTAR14-1 , GTAR14-5, and GTARll proteins, which 
have structural similarities to known odorant receptors, could 
function as a receptor for a novel hematopoietic regulator. 

The GTAR14-3 gene was expressed in thymus and testis as described 
above, and also in digestive tissues, including small intestine and 

30 colon, and endocrine organs , such as placenta and prostate. Moreover, 
the result of RT-PCR using mRNA prepared from human fetal tissues 
showed a broad distribution pattern of GTAR14-3 gene expression, with 
strong expression in thymus, brain, heart, smooth muscle, and kidney, 
and weak expression in fetal liver and fetal lung. This not only 

35 suggests that GTAR14-3 protein can function as a receptor of a novel 
hematopoietic factor, but also strongly suggests that the protein 
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could regulate diverse physiological functions as a receptor of a 
novel hormone-like peptide molecule. 

The GTAR proteins of the present invention, which can play a 
central role in the function of immune responsive cells, may find 
5 utility in clinical treatment of the following field of diseases. 
First, it is possible to enhance the cellular immune system of 
the living body by promoting the activity of GTAR proteins through 
in vivo administration of ( 1 ) a ligand or agonist derived from the 
living body that is capable of functionally binding to GTAR proteins, 

10 or (2) a specific antibody that can activate the function of GTAR 
proteins. Such substances that are capable of functionally binding 
to GTAR proteins can be clinically applied as a growth promoting factor , 
a differentiation inducing factor, or an activator of immune cell 
functions in immune responsive cells , especially in the group of T-cells . 

15 This means that they can promote in viva immune response, including 
anti tumor immunity and anti infection immunity. More specifically, 
it is possible to enhance cytotoxic immunity toward a particular type 
of tumor tissue, and also increase resistance against a variety of 
viruses, including but not limited to HCV and HIV, by enhancing anti 

20 infection immunity in the living body. 

Next, it is possible to suppress the cellular immunity in the 

living body by promoting the inhibition of the activity of GTAR proteins 

« 

through in vivo administration of ( 1 ) an antagonist or inhibitor that 
is capable of functionally binding to GTAR proteins, or (2) a specific 

25 antibody that can inhibit the function of GTAR proteins . Such 
substances that are capable of functionally binding to GTAR proteins 
can be clinically applied as a growth inhibiting factor, a 
differentiation inhibiting factor, or a suppressor of immune cell 
functions of immune responsive cells , especially of the group of T-cells . 

30 Thus, they can suppress autoimmune diseases resulting from self 
damaging, and rejection reactions accompanying tissue transplantation , 
a most critical issue in transplantation immunity . More specifically , 
they are expected to be very effective for diseases including, but 
not limited to, diabetes, liver damage, collagen arthritis, GVHD, 

35 and EAE, all of which are induced by abnormal activation of immune 
response. Immunosuppression by the inhibitors could also be 
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effective for a variety of antigen specific allergies, such as metal 
allergy and pollen allergy . 

The present invention further provides proteins that are 
functionally equivalent to the GTAR proteins of the present invention. 
5 Herein, "functionally equivalent" means that a protein has an 
equivalent biological activity to that of GTAR proteins. An 
illustrative biological activity is the ability of GTAR to act as 
a G protein-coupled receptor protein, which is an activity to interact 
with intracellular G proteins in response to ligand stimuli and 

10 transduce signals from outside to inside of cells. 

A functionally equivalent protein may be obtained by introducing 
a mutation into the amino acid sequence of the protein. For instance, 
site-directed mutagenesis using synthetic oligonucleotide primers 
may be used to introduce a desired mutation (Kramer W. and Fritz H.J. 

15 Methods Enzymol. 154:350-367 (1987)). PCR mediated site-directed 
mutagenesis system (GIBCO-BRL) may also be used for introducing 
mutations into amino acid sequences of proteins. By using these 
methods, it is possible to obtain a functionally equivalent protein 
to GTAR proteins having an amino acid sequence of a GTAR protein (i.e. , 

20 SEQ ID NO: 4 to 6 , and SEQ ID NO: 2 8 to 31) in which one or more amino 
acids are deleted, added, and/or substituted by other amino acid 
residues without affecting their biological activities. 

More specifically, a functionally equivalent protein to GTAR 
proteins may have a sequence in which one or more, preferably two 

25 or more but not more than 30, more preferably two or more but not 
more than 10 amino acid residues are deleted from the amino acid sequence 
of SEQ ID NOs : 4 to 6 or SEQ ID NOs : 2 8 to 3 1 . A functionally equivalent 
protein to a GTAR protein of the present invention may have a sequence 
in which one or more, preferably two or more but not more than 30, 

30 more preferably two or more but not more than 10 amino acid residues 
are added to the amino acid sequence of SEQ ID NOs: 4 to 6 or SEQ 
ID NOs: 2 8 to 31 . A functionally equivalent protein to a GTAR protein 
of the present invention may have a sequence in which one or more, 
preferably two or more but not morfe than 30, more preferably two or 

35 more but not more than 10 amino acid residues are substituted by other 
amino acids in the amino acid sequence of SEQ ID NOs: 4 to 6 or SEQ 
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ID NOs: 28 to 31. 

It is well known that a protein having deletion , addition , and/or 
substitution of one or more amino acid residues in the sequence of 
a protein can retain the original biological activity (Mark D.F. et 
5 al. Proc. Natl. Acad. Sci. U.S.A. 81:5662-5666 (1984); Zoller M.J. 
and Smith M. Nucleic Acids Res. 10:6487-6500 (1982); Wang A. et al. 
Science 224 : 1431-1433; Dalbadie-McFarland G. et al. Proc. Natl. Acad. 
Sci. U.S.A. 79:6409-6413 (1982)). 

For. instance, a protein in which one or more amino acid residues 

10 are added to GTAR proteins may be a fusion protein comprising GTAR 
proteins. The present invention provides for a fusion protein, 
wherein one or more GTAR proteins and one or more of another protein 
or peptide are fused. Conventional methods may be used to generate 
a fusion protein of the present invention. For example, a DNA encoding 

15 the GTAR protein and a DNA encoding the other protein or peptide are 
ligated so as to match their ORFs in the same frame, and introduced 
into an expression vector. The resulting fusion protein is then 
expressed in a host cell. The protein or peptide to be fused to the 
GTAR protein of the present invention is not limited to any specific 

20 protein or peptide. 

For instance, known peptides including FLAG peptide (Hopp T.P. 
et al. BioTechnology 6:1204-1210 (1988)), 6xHis that is made up of 
six histidine residues, lOxHis, influenza hemagglutinin (HA), 'human 
c-myc fragment, VSV-GP fragment, pl8HIV fragment, T7-tag, HSV-tag, 

25 E-tag, SV40 T antigen fragment, lck tag, a- tubulin fragment, B-tag, 
and Protein C fragment may be used for fusion. Also, 
glutathione-S-transf erase (GST), influenza hemagglutinin (HA), the 
constant region of immunoglobulin, p-galactosidase, maltose binding 
protein (MBP), and the like may be used as a protein to be fused. 

30 DNA encoding these proteins are known in the art and/or commercially 
available . 

The present invention also provides proteins encoded by DNA that 
hybridizes with DNA having a nucleotide sequence of SEQ ID NOs: 1 
to 3 or 24 to 27 under stringent conditions, and is functionally 
35 equivalent to GTAR proteins . Stringent conditions may be 

appropriately chosen by a skilled artisan, and include low stringent 
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conditions. Low stringent conditions may be, for example, at 42°C 
in 2x SSC and 0.1% SDS, preferably at 50°C in 2x SSC and 0.1% SDS. 
More preferably, high stringent conditions such as at 65 °C in 2x SSC 
and 0.1% SDS may be chosen. DNA with higher homology can be obtained 
5 at higher temperature under these conditions. 

Furthermore, the present invention provides proteins that are 
functionally equivalent to GTAR proteins and homologous to the amino 
acid sequence of the proteins (SEQ ID NOs: 4 to 6 and 2 8 to 31). In 
the context of the present invention, a protein is "homologous" 

10 provided it has at least 70% homology, preferably 80% or higher, more 
preferably 9 0 % or higher , and even more preferably 9 5 % or higher homology 
to an amino acid sequence of SEQ ID NOs: 4 to 6 or 2 8 to 31. The 
degree of homology between two proteins can be determined by using 
an algorithm described in the literature (Wilbur W.J. and Lipman D.J. 

15 Proc. Natl. Acad. Sci. U.S.A. 80:726-730 (1983)). 

A protein of the present invention may have variations in amino 
acid sequence, molecular weight, isoelectric point, the presence or 
absence of sugar chains, or form, depending on the cell or host used 
to produce it or the purification method utilized . Nevertheless, 

20 as long as it has an activity as a G protein-coupled receptor, it 
is within the scope of the present invention. 

A protein of this invention may be produced by inserting the 
obtained DNA into an expression vector and expressing under the 
regulation of an expression regulating region, such as an enhancer 

25 or promoter. The expression vector is used to transform a host cell, 
where the protein is expressed. 

More specifically, for expression in mammalian host cells, one 
may construct a DNA or a vector in which a standard useful 
promoter /enhancer , a DNA encoding a protein of the invention, and 

30 a polyA signal at the 3 ' downstream are functionally integrated. For 
instance, human cytomegalovirus immediate early promoter /enhancer 
may be used as a promoter /enhancer . 

Also, virus promoter/enhancers such as those of retrovirus, 
polyomavirus , adenovirus, and simian virus 40 (SV40), or mammalian 

35 promoter /enhancers such as that of human elongation factor la (HEFla) 
may be used as a promoter /enhancer for protein expression. 



For instance, the SV4 0 promoter /enhancer may be readily used 
according to the method of Mulligan et al. (Nature 277:108 (1979)), 
and HEFla promoter/enhancer may be readily used following the method 
of Mizushima et al. (Nucleic Acids Res. 18:5322 (1990)). 

For expression in E . coli, a DNA may be constructed in which 
a standard useful promoter , a signal sequence for polypeptide secretion , 
and the gene to be expressed are functionally integrated . The promoter 
may be, for example, the LacZ promoter, the araB promoter, and the 
like. The LacZ promoter may be used according to the method of Ward 
et al. (Nature 341:544-546 (1998) ; FASEB J. 6:2422-2427 (1992) ) . The 
araB promoter may be used following the method of Better et al. ( Science 
240:1041-1043 (1988)). 

For producing the protein in the periplasm of E. coli, the pelB 
signal sequence (Lei S.P. et al. J. Bacteriol. 169:4379 (1987)) may 
be used as a signal sequence. 

The origin used for replication may be those of SV4 0 , polyomavirus , 
adenovirus, bovine papilloma virus (BPV) , and the like. In addition, 
the expression vector may include a selection marker gene for 
amplification of the gene copies in host cells. Examples of such 
markers include, but are not limited to, the aminoglycoside transferase 
(APH) gene, the thymidine kinase (TK) gene, the£. coli xanthine-guanine 
phosphoribosyl transferase (Ecogpt) gene, and the dihydrof olate 
reductase (dhfr) gene.. 

The expression vector for production of the protein of the 
invention is not limited to any specific one as long as it is suitably 
used for the purpose. For example, the expression vector may be a 
mammal -derived expression vector (e.g., pEF and pCDM8 ) , an insect 
cell-derived expression vector (e.g., pBacPAK8), a plant-derived 
expression vector (e.g., pMHl and pMH2 ) , an animal virus-derived 
expression vector (e.g., pHSV, pMV, and pAdexLcw) , a 
retrovirus-derived expression vector (e.g., pZIpneo), an 
yeast-derived expression vector (e.g. , pNVll and SP-Q01) , a Bacillus 
sujbtilis-derived expression vector (e.g., pPL608 and pKTHSO), or an 
E. coli-derived expression vector (e.g., pQE, pGEAPP, pGEMEAPP, and 
pMALp2 ) . 

The vector of the present invention not only may be used for 
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producing the protein of the invention in vivo or in vitro, but also 
may be used for gene therapy in mammals including humans. 

Known methods, such as calcium phosphate method (Virology 
52:456-467 (1973)) and electroporation (EMBO J. 1:841-845 (1982)), 
5 may be used to introduce the expression vector constructed above into 
a host cells. 

Any production system can be used herein for producing proteins . 

The present invention provides methods of producing a protein of the 

invention both in vitro or in vivo. 
10 For in vitro production, eukaryotic cells or prokaryotic cells 

can be used. Useful eukaryotic cells may be animal, plant, or fungi 

cells. As animal cells, mammalian cells such as CHO (J. Exp. Med. 

108:945 (1995)), COS, myeloma, baby hamster kidney (BHK) , HeLa, or 

Vero cells, amphibian cells such as Xenopus oocytes (Valle et al. 
15 Nature 291:340-358 (1981)), or insect cells such as sf9, sf21, or 

Tn5 cells can be used. CHO cells lacking DHFR gene (dhfr-CHO) (Proc. 

Natl. Acad. Sci. U.S.A. 77:4216-4220 (1980)) or CHO K-l (Proc. Natl. 

Acad. Sci. U.S.A. 60:1275 (1968)) may also be used. 

As plant cells, plant cells originating from Nicotiana tabacum 
20 are known and may be used as callus cultures. As fungi cells, yeast 

cells such as Saccharomyces , including Saccharomyces cerevisiae , or 

filamentous fungi such as Aspergillus , including Aspergillus niger, 

are known and may be used herein. 

Useful prokaryotic cells include bacterial cells, such as E. 
25 coli or Bacillus subtilis . 

These cells are transformed by a desired DNA, and the resulting 

trans formants are cultured in vitro to obtain the protein. 

Transf ormants can be cultured using known methods. Culture medium 

such as DMEM, MEM, RPMI164 0, or IMDM may be used with or without serum 
30 supplement such as fetal calf serum (FCS). The pH of the culture 

medium is preferably between about 6 and 8. Such cells are typically 

cultured at about 30 to 40 °C for about 15 to 200 hr, and the culture 

medium may be replaced, aerated, or stirred if necessary. 

Animal and plant hosts may be used for in vivo production. For 
35 example, a desired DNA can be introduced into an animal or plant host. 

Encoded proteins are produced in vivo, and then recovered. These 
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animal and plant hosts are included in host cells of the present 
invention. 

Animals to be used for the production system described above 
include, but are not limited to, mammals and insects. Mammals such 
5 as goat, porcine, sheep, mouse, and bovine may be used (Vicki Glaser, 
SPECTRUM Biotechnology Applications (1993)). Alternatively, the 
mammals may be transgenic animals. 

For instance, a desired DNA may be prepared as a fusion gene 
by inserting it into the middle of a gene, such as goat p casein gene 
10 which encodes a protein specifically produced into milk. DNA 
fragments comprising the fusion gene having the desired DNA are injected 
into goat embryos , which are then introduced back to female goats . 
Proteins are recovered from milk produced by the transgenic goats 
(i.e. , those born from the goats that had received the modified embryos ) 
15 or from their offspring. To increase the amount of milk containing 
the proteins produced by transgenic goats, appropriate hormones may 
be administered to them (Ebert K.M. et al. Bio/Technology 12:699-702 
(1994)). 

Alternatively, insects, such as the silkworm, may be used. A 

20 desired DNA inserted into baculovirus can be used to infect silkworms, 
and the desired protein is recovered from their body fluid (Susumu 
M . et al. Nature 315:592-594 (1985)). 

As plants, for example, tobacco can be used. In use of tobacco, 
a desired DNA may be inserted into a plant expression vector, such 

25 as pMON530, which is introduced into a bacteria, such as Agrobacterium 
tumefaciens. Then the bacteria is used to infect tobacco, such as 
Nicotlana tabacum , and a desired polypeptide is recovered from their 
leaves (Julian K.-C. Ma et al. Eur. J. Immunol. 24:131-138 (1994)). 
A protein of the present invention obtained as above may be 

30 isolated from inside or outside of cells or hosts, and purified as 
a substantially pure homogeneous protein. The method for protein 
isolation and purification is not limited to any specific method; 
in fact , any standard method may be used . For instance , column 
chromatography, filter, ultrafiltration, salt precipitation, solvent 

35 precipitation, solvent extraction, distillation, 

immunoprecipitation, SDS-polyacrylamide gel electrophoresis , 
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isoelectric point electrophoresis, dialysis, and recrystallization 
may be appropriately selected and combined to isolate and purify the 
protein . 

For chromatography, for example, affinity chromatography, 
5 ion-exchange chromatography, hydrophobic chromatography, gel 
filtration, reverse phase chromatography, adsorption chromatography , 
and such may be used (Strategies for Protein Purification and 
Characterization: A Laboratory Course Manual. Ed. Daniel R. Marshak 
et al., Cold Spring Harbor Laboratory Press (1996)). These 
10 chromatographies may be performed by liquid chromatography such as 
HPLC and FPLC. Thus, the present invention provides for highly 
purified proteins, produced by the above methods . 

A protein of the present invention may be optionally modified 
or partially deleted by treating it with an appropriate protein 
15 modification enzyme before or after purification. Useful protein 
modification enzymes include, but are not limited to, trypsin, 
chymotrypsin, lysylendopeptidase, protein kinase, and glucosidase. 

The present invention also provides partial peptides of the GTAR 
proteins of the present invention (i.e., SEQ ID NOs: 4 to 6 and 28 
20 to 31). A partial peptide may be, for example, a peptide which is 
one of the partial peptide sequences encoded by GTAR genes, and 
corresponds to the binding domain to an in vivo ligand. Such a partial 
peptide , when administered in vivo , may bind to the ligand competitively , 
and inhibit the binding between the GTAR protein and the ligand. 
25 Similarly, a partial peptide corresponding to the G-protein binding 
domain of a GTAR protein may antagonistically inhibit the binding 
between GTAR protein and intracellular G protein. Such partial 
peptides may be useful as inhibitors of signal transduction pathways 
mediated by GTAR proteins . 
30 A partial peptide of the protein of the invention can be produced 

by genetic engineering, known methods of peptide synthesis, or by 
digesting the protein of the invention with an appropriate peptidase. 
For peptide synthesis, for example, solid phase synthesis or liquid 
phase synthesis may be used. 
35 Furthermore, the present invention relates to a DNA encoding 

the protein of the present invention as described. A cDNA encoding 



the protein may be obtained by, for example, screening a human cDNA 
library using one or more probes described herein. 

A cDNAmay also be obtained from different cells , tissues, organs, 
or species by further screening a cDNA library using the obtained 
cDNA or cDNA fragment as a probe. A cDNA library may be prepared, 
for example, by the method described in literature (Sambrook J. et 
al. Molecular Cloning, Cold Spring Harbor Laboratory Press (1989)), 
or may be commercially available. 

The nucleotide sequence of the obtained cDNA is determined to 
find an open reading frame, and thereby the amino acid sequence of 
the protein of the invention can be obtained. The cDNA obtained may 
also be used as a probe for screening a genomic library to isolate 
a genomic DNA. 

More specifically, mRNAsmay first be prepared fromacell, tissue, 
or organ in which the protein of the invention is expressed. Known 
methods can be used to isolate mRNAs ; for instance , total RNA is prepared 
by guanidine ultracentrif ugation (Chirgwin J.M. et al. Biochemistry 
18:5294-5299 (1979)) or AGPC method (Chomczynski P. and Sacchi N. 
Anal. Biochem. 162:156-159 (1987)), and mRNA is purified from total 
RNA using mRNA Purification Kit (Pharmacia) and such. 
Alternatively, mRNA may be directly purified by QuickPrep mRNA 
Purification Kit (Pharmacia). 

The obtained mRNA is used to synthesize cDNA using reverse 
transcriptase. cDNA may be synthesized by using a kit such as the 
AMV Reverse Transcriptase First-strand cDNA Synthesis Kit (Seikagaku 
Kogyo). Alternatively, cDNA may be synthesized and amplified 
following the 5 ' -RACE method (Frohman M.A. et al. Proc. Natl. Acad. 
Sci. U.S.A. 85:8998-9002 (1988); Belyavsky A. ,et al. Nucleic Acids 
Res. 17:2919-2932 (1989)) which uses a probe described herein, the 
5'-Ampli FINDER RACE Kit (Clontech), and polymerase chain reaction 
(PCR) . 

A desired DNA fragment is prepared from the PCR products and 
ligated with a vector DNA. The recombinant vectors are used to 
transform JET. coll and such, and a desired recombinant vector is prepared 
from a selected colony. The nucleotide sequence of the desired DNA 
is verified by conventional methods, such as dideoxynucleotide chain 
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termination* 

A DNA of the invention may be designed to have a sequence that 
is expressed more efficiently by taking into account the frequency 
of codon usage in the host to be used for expression (Grantham R. 
et al. Nucleic Acids Res. 9:43-74 (1981)). The DNA of the present 
invention may be altered by a commercially available kit or a 
conventional method . For instance , the DNAmay be altered by digestion 
with restriction enzymes, insertion of a synthetic oligonucleotide 
or an appropriate DNA fragment, addition of a linker, or insertion 
of the initiation codon (ATG) and/or the stop codon (TAA, TGA, or 
TAG) . 

The present invention particularly provides DNA having the 
following nucleotide sequences: from A at 9 to C at 94 7 of SEQ ID 
NO: 1, from A at 13 to T at 951 of SEQ ID NO: 2, from A at 410 to 
T at 1339 of SEQ ID NO: 3, from A at 17 to G at 892 of SEQ ID NO: 
24, from A at 18 to C at 956 of SEQ ID NO: 25, from A at 18 to C at 
956 of SEQ ID NO: 26, or from A at 19 to C at 945 of SEQ ID NO: 27. 

Furthermore, the present invention provides DNA that is capable 
of hybridizing with DNA having a nucleotide sequence of SEQ ID NOs : 
1 to 3 or 24 to 27 under stringent conditions, and encoding a protein 
functionally equivalent to the protein of the invention described 
above. Stringent conditions may be appropriately chosen by one 
skilled in the art- For instance, low stringent conditions may be 
42°C in 2xSSC and 0.1% SDS, and preferably 50°C in 2xSSC and 0.1% 
25 SDS . More preferably, high stringent conditions such as 65°C in 2xSSC 
and 0.1% SDS may be chosen. Under the conditions, DNAs having higher 
homologies can be obtained with increasing temperature. The 
hybridizing DNA above is preferably a cDNA or a chromosomal DNA. 

The protein of the invention may be used for screening a compound 
that is capable of binding to the protein, such compound beinguseful 
as a drug, or the like. Specifically, the protein may be used in 
a method of screening the compound, such method comprising the steps 
of exposing the protein of the present invention to a test sample 
in which a compound binding to the protein is expected to be contained, 
and selecting the compound having the activity of binding to the protein . 

The proteins of the invention used for screening may be 



20 



30 



35 
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recombinant or wild type proteins, or partial peptides. 
Alternatively, they may be purified proteins or partial peptides, 
or in the form expressed on the surface of cells or in the form of 
a membrane fraction. 
5 A compound having the activity of binding to the protein of the 

invention may be a natural or artificially synthesized compound. A 
natural compound may be, for example, a ligand binding to the protein 
of the invention. A ligand herein is defined as a natural compound 
that binds to a protein of the invention and induces a specific 
10 physiological reaction in cells expressing the protein. The 
physiological reaction includes, but is not limited to, biochemical 
changes such as production of cAMP, change in the concentration of 
intracellular calcium, or activation of protein kinase C, as described 
below. 

15 Screening for a protein which is capable of binding to the protein 

of the invention may be carried out by, for example, using the 
West-Western blotting method (SkolnikE.Y. et al. Cell 65:83-90 (1991) ) • 
Specifically, cDNAs are isolated from cells, tissues, or organs in 
which the proteins binding to the protein of the present invention 

20 is expected to be expressed, and introduced into a phage vector such 
as Xgtll or ZAPII to construct a cDNA library. The phage vectors 
of the library are spread on plates containing medium, and proteins 
are expressed. The proteins expressed are transferred to a filter 
membrane, which is reacted with a labeled protein of the invention 

25 and then purified. The plaques expressing those proteins that bound 
to the protein of the invention can be identified by detecting the 
label. The protein of the invention may be labeled by a method 
utilizing the binding between biotin and avidin, or a method utilizing 
an antibody that specifically binds to the protein of the present 

30 invention, or a peptide or polypeptide that is fused to the protein 
of the present invention . Methods using radioisotope or fluorescence 
and such may also be used. 

Alternatively, in another embodiment of the method of screening 
of the present invention, a two-hybrid system utilizing cells may 

35 be used (Fields S . and Sternglanz R. Trends Genet . 10:286-292 (1994)). 

The two-hybrid system can be used as follows . A f irstexpression 



vector, comprising a DNA encoding a fusion protein of the present 
invention and a subunit of a transcription factor that forms a 
heterodimer , and a second expression vector, comprising a desired 
cDNA to be tested ligated to a DNA encoding the other subunit of the 
transcription factor, are introduced into cells and induced. A 
reporter gene that was integrated into the cells is expressed only 
if the protein of the invention and the protein encoded by a desired 
cDNA bind to each other and when the transcription factors form a 
heterodimer. Thus, a protein that is capable of binding to a protein 
of the invention can be selected by detecting or measuring the expression 
level of the reporter gene. 

More specifically, a DNA encoding a protein of the invention 
and a gene encoding the DNA binding domain of LexA are ligated in 
frame to construct an expression vector. Next, the desired cDNA and 
a gene encoding the GAL 4 transcription activation domain are ligated 
to obtain another expression vector. 

Cells carrying the HIS3 gene under the expressional regulation 
of a promoter containing a LexA binding motif are transformed with 
the above two expression plasmids of the two-hybrid system, and 
incubated in a synthetic medium lacking histidine. Cells can grow 
only if there is a protein-protein interaction. Thus, the expression 
level of the reporter gene is examined by the growth rate of 
transf ormants . 

Alternatively, instead of HIS3 gene, the reporter gene may be 
a luciferase gene, a plasminogen activator inhibitor typel (PAI-1) 
gene, or else. 

The two-hybrid system may be constructed by a standard method, 
or a commercial kit such as MATCHMAKER Two-Hybrid System, Mammalian 
MATCHMAKER Two-Hybrid Assay Kit (both from Clontech), or HybriZAP 
Two-Hybrid Vector System (Stratagene) may be used. 

In another embodiment, the method for screening of the present 
invention utilizes affinity chromatography. The protein of the 
invention is immobilized on a carrier of an affinity column, and a 
test sample, in which a protein capable of binding to the protein 
of the invention is supposed to be expressed, is applied to the column. 
A test sample herein may be cell culture supernatant, cell extracts, 
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cell lysates , etc . After loading the test sample, the column is washed, 
and proteins bound to the protein of the invention can be obtained. 

Alternatively, a compound that binds to the protein of the 
invention may be screened by stably expressing the protein of the 
5 present invention on the surface of an appropriate host cell, exposing 
the cell to a sample which is assumed to contain a ligand, and detecting 
a biochemical change induced in the cell. The host cell may be one 
of a variety of mammalian cells, but preferably it is a cell which 
naturally express the protein of the invention , for example, the immune 

10 cells. Different biochemical changes may be used as an index, 
depending on the type of G proteins that bind to the protein of the 
invention. Examples include, but are not limited to, cAMP production, 
change in intracellular calcium concentration, and activation of 
protein kinase C. Alternatively, as a more generally used index, 

15 micro extracellular pH change can be measured by a microphysiometer 
(for instance, "Cytosensor R " , Molecular Device Inc.). 

Samples to be tested in the method for screening of the present 
invention include, but are not limited to, peptides, purified or crude 
preparation of proteins, non-peptide compounds, synthetic compounds, 

20 products of microorganism fermentation, cell extracts, animal tissue 
extracts, marine organism extracts, and plant extracts. 

Compounds isolated by the above method for screening may be a 
candidate for a drug that can promote or inhibit the activity of the 
protein of the invention and thus beusef ul in the treatment of a disease 

25 caused by the abnormal function of a receptor protein of the invention. 
That part of the compound having the activity to bind to the protein 
of the present invention obtained by the method for screening of the 
present invention may be structurally changed by addition, deletion, 
and/or substitution . Such compound parts or fragments are also within 

30 the scope of the present invention. 

It is possible to screen an agonist or antagonist of a receptor 
protein of the invention by using as an index the biochemical changes 
described above or the activity inhibiting the binding between the 
receptor protein and its ligand . Herein , an agonist is defined herein 

35 as a compound that is not derived from nature but is capable of binding 
to the protein ( s ) of the invention and inducing a physiological reaction 
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in cells in a manner analogous to that of a ligand. Likewise, an 
antagonist is defined herein as a compound that inhibits the function 
of a ligand or agonist toward the protein(s) of the invention and 
includes both natural and unnatural compounds. 
5 In a method in which an intracellular biochemical change is used 

as an index, for example, the above-described cells stably expressing 
a protein of the invention may be exposed to a test sample in the 
presence of a ligand, and the above-described biochemical index, such 
as cAMP production, change of intracellular calcium concentration, 
10 or activation of protein kinase C, may be measured. If inhibition 
is seen in the biochemical index, the compound contained in the test 
sample is judged to be an antagonist of the protein of the present 
invention. 

Alternatively, for example, the above cells stably expressing 
15 a protein of the invention may be directly exposed to a test sample, 
and a biochemical change as above described may be measured to isolate 
an agonist. 

In a method in which the activity to inhibit the binding between 
a receptor protein of the invention and a ligand is used as an index, 

20 for example, phage display method may be used to isolate an agonist 
or antagonist- Phage display may be performed using, for example, 
the pSKAN Phagemid Display System (Catalogue #OB-15 18-00) according 
to the literature (Nicholas C. et al. Science 273:458 (1996)). 

A compound obtained by the method of screening of the present 

25 invention may be applied as a medicine to humans or others mammals, 
such as mouse, rat, guinea pig, rabbit , chicken, cat, dog, sheep, 
porcine, bovine , monkey, sacred baboon, or chimpanzee, according to 
the standard procedures. 

For instance, the compound may be administered orally, in the 

30 form of tablet, if necessary coated with sugar, capsule, elixir, or 
microcapsule, or may be parenterally administered, in the form of 
sterile solution in water or other pharmaceutically acceptable 
solution, or injection of suspension. A medicine may be prepared 
by, for example, mixing a substance having activity to bind to the 

35 protein of the present invention with a physiologically acceptable 
carrier, flavor, excipient, vehicle, preservative, stabilizer, or 
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binder to a form a unit or dosage that is required by standard 
pharmaceutical procedures. The amount of active ingredient 

contained within the medicine may provide an appropriate dose within 
an indicated range. 
5 Additives, for instance, binders such as gelatin, corn starch, 

tragacanth gum, or Arabia gum, excipients such as crystalline cellulose, 
swelling agents such as corn starch, gelatin, or alginic acid, 
lubricants such as magnesium stearate, sweeteners such as sucrose, 
lactose, or saccharine, or flavors such as peppermint, Gaultheria 
10 adenothrix oil, or cherry may be mixed within the tablet or capsule. 
If the preparative unit form is a capsule, liquid carriers such as 
fatty oils may be included as well . Sterile compositions for injection 
may be prepared by using vehicles such as distilled water for injection 
according to a standard pharmaceutical procedures . 
15 Aqueous solutions such as saline, or isotonic solutions 

containing glucose or other adjuvants such as D-sorbitol, D-mannose, 
D-mannitol, or sodium chloride may be used for injection. To increase 
solubility, alcohols such as ethanol, polyalcohols such as propylene 
glycol, or polyethylene glycol, or non-ionic detergents such as 
20 polysorbate 80™, or HCO-50 may be appropriately added as an adjuvant. 

Sesame oil or soybean oil may be used as oleaginous solution. 
Solubilizing agents such as benzyl-benzoate or benzyl alcohol may 
be added to increase solubility. Buffers such as phosphate buffer 
or sodium acetate buffer, pain-killers such as procaine chloride, 
25 stabilizers such as benzyl alcohol or phenol, or antioxidants may 
be mixed as well. Prepared solutions for injection may be packed 
in an appropriate ampoule. 

The dose of the compound having the activity to bind to the protein 
of the invention may vary depending on symptom, but may be, in general, 
30 approximately 0 . 1 to 100 mg, preferably approximately 1.0 to 50 mg, 
and more preferably approximately 1.0 to 20 mg administered orally 
per day for adult (60 kg body weight). 

In case of parenteral administration, a dose of, for example, 
approximately 0.01 to 30 mg, preferably approximately 0.1 to 20 mg, 
35 and more preferably approximately 0.1 to 10 mg may be injected 
intravenously per day for adult (60 kg body weight), although the 
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dose may vary depending on the recipient, target organ, symptom, or 
method. In case of other animals, a dose normalized by the dose per 
60 kg body weight may be applied. 

An antibody of the present invention may be obtained as a 
5 monoclonal or polyclonal antibody by conventional methods. 

To produce an antibody that specifically binds to the protein 
of the invention, the protein may be used as an antigen to immunize 
animals according to a standard method . Immune cells are thus obtained , 
and fused with known parental cells by standard cell fusion methods. 
10 Fusion cells are then screened for cells producing antibodies using 
standard methods. 

More specifically, a monoclonal or polyclonal antibody that 
specifically binds to the protein of the invention may be generated 
as follows: 

15 For instance, a protein of the invention used as an antigen to 

obtain an antibody may be derived from any animal species , but preferably 
it is from a mammal such as a human, mouse, or rat, or more preferably 
from a human. A human-derived protein may be obtained from the 
nucleotide or amino acid sequences disclosed herein, 

20 Any protein having a biological activity of the proteins 

described herein, or its partial peptide may be used as an antigen 
in the present invention. A partial peptide may be, for example, 
an amino ( N ) -terminal or carboxy ( C ) -terminal fragment of the protein . 
Herein, an antibody is defined as an antibody that specifically reacts 

25 with either the full length or a fragment of the protein. 

A gene encoding a protein of the invention or its fragment may 
be inserted into a known expression vector, which is used to transform 
a host cell as described herein. The desired protein or its fragment 
may be recovered from the outside or inside of host cells, or from 

30 host cells by any standard method, and may be used as an antigen. 
Alternatively, cells expressing the protein or their lysates, or a 
chemically synthesized protein may be used as an antigen. 

Any mammalian animal may be immunized with the antigen, but 
preferably the compatibility with parental cells used for cell fusion 

35 is taken into account. In general, animals of Rodentia, Lagomorpha, 
or Primates are used. 
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Animals of Rodentia include, for example, mouse, rat, and hamster. 
Animals of Lagomorpha include, for example, rabbit. Animals of 
Primates include, for example, amonkey of Catarrhini ( old world monkey ) 
such as Macaca fascicularis , rhesus monkey, sacred baboon, or 
5 chimpanzee. 

Methods for immunizing animals with antigens are known in the 
art. For instance, intraperitoneal injection or subcutaneous 
injection of antigens is used as a standard method for immunization 
of mammals. More specifically, antigens may be diluted and suspended 
10 in an appropriate amount with phosphate buffered saline (PBS), 
physiological saline, etc. If desired, the antigen suspension may 
be mixed with an appropriate amount of a standard adjuvant, such as 
Freund's complete adjuvant, made into emulsion, and then administered 
to mammalian animals. Preferably, it is followed by several 

15 administrations of antigen mixed with an appropriately amount of 
Freund's incomplete adjuvant every 4 to 21 days. An appropriate 
carrier may also be used for immunization. After immunization as 
above , serum is examined for increase of the amount of desired antibodies 
by a standard method. 

20 Polyclonal antibodies to the proteins of the present invention 

may be prepared by collecting blood from the immunized mammal examined 
for the increase of desired antibodies in the serum, and by separating 
serum from the blood by any conventional method. Polyclonal 
antibodies may be used as serum containing the polyclonal antibodies, 

25 or if necessary, a fraction containing the polyclonal antibodies may 
be isolated from the serum. 

To prepare monoclonal antibodies, immune cells are collected 
from the mammal immunized with the antigen and checked for the increased 
level of desired antibodies in the serum as described above, and are 

30 subjected to cell fusion. The immune cells used for cell fusion are 
preferably obtained from spleen . The other parent cell which is fused 
with the above immune cell is preferably a mammalian myeloma cell, 
and more preferably a myeloma cell that has acquired a special feature 
that can be used for selection of fusion cells by drug. 

35 Cell fusion of the above immune cell and myeloma cell may be 

performed by any standard method, such as those described in the 
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literature (Galfre G. andMilstein C . Methods Enzymol. 73 : 3-46 ( 1981 ) ) , 
Resulting hybridomas obtained by the cell fusion may be selected 
by cultivating them in a standard selection medium, such as HAT medium 
(hypoxanthine, aminopterin, and thymidine containing medium) . The 
5 cell culture is typically continued in the HAT medium for several 
days to several weeks, the time being sufficient to allow all the 
other cells , except desired hybridoma ( non-fused cells ) , to die • Then , 
the standard limiting dilution is performed to screen and clone a 
hybridoma cell producing the desired antibody. 

10 Besides the above method, in which anonhuman animal is immunized 

with an antigen for preparing hybridoma, human lymphocytes such as 
that infected by EB virus may be immunized with a protein, protein 
expressing cells, or their lysates in vitro. Then, the immunized 
lymphocytes are fused with human-derived myeloma cells that is capable 

15 of indefinitely dividing, such as U266, to yield a hybridoma producing 
a desired human antibody, able to bind to the protein can be obtained 
(Unexamined Published Japanese Patent Application (JP-A) No, Sho 
63-17688) . 

Furthermore, transgenic animals having a repertory of human 

20 antibody genes may be immunized with a protein, protein expressing 
cells, or their lysates as an antigen. Antibody producing cells are 
collected from the animals, and fused with myeloma cells to obtain 
hybridoma, from which human antibodies against the protein can be 
prepared (see WO92-03918, W093-2227, -WO94-02602, W094-25585, 

25 W096-33735, and WO96-34096) 

Alternatively, an immune cell, such as an immunized lymphocyte, 
producing antibodies may be immortalized by an oncogene and used for 
preparing monoclonal antibodies. 

Monoclonal antibodies thus obtained can be also be recombinantly 

30 prepared using conventional genetic engineering techniques (see, for 
example, Borrebaeck C.A.K, and Larrick J.W. Therapeutic Monoclonal 
Antibodies, published in the United Kingdom by MacMillan Publishers 
LTD (1990) ) • A DNA encoding an antibody may be cloned from an immune 
cell such as a hybridoma or an immunized lymphocyte producing the 

35 antibody, inserted into an appropriate vector, and introduced into 
host cells to prepare a recombinant antibody. The present invention 
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also provides recombinant antibodies prepared as described above. 
An antibody of the present invention may be a fragment of an 
antibody or modified antibody, solong as it binds to one or more of 
the proteins of the invention. For instance, the antibody fragment 
5 may be Fab, F(ab' ) 2f Fv, or single chain Fv (scFv), in which Fv fragments 
from H and L chains are ligated by an appropriate linker (Huston J.S. 
et al. Proc. Natl, Acad. Sci. U.S.A. 85:5879-5883 (1988)). More 
specifically, an antibody fragment may be generated by treating an 
antibody with an enzyme such as papain or pepsin. Alternatively, a 
10 gene encoding the antibody fragment may be constructed, inserted into 
an expression vector, and expressed in an appropriate host cell (see, 
for example, Co M.S. et al. J. Immunol. 152:2968-2976 (1994); Better 
M. and Horwitz A. H . Methods Enzymol. 178:476-496 (1989); Pluckthun 
A. andSkerraA. Methods Enzymol . 178:497-515 (1989) ; Lamoyi E . Methods 
15 Enzymol. 121:652-663 (1986); Rousseaux J. et al . Methods Enzymol. 
121:663-669 (1986); Bird R.E. and Walker B.W. Trends Biotechnol. 
9:132-137 (1991)). 

An antibody may be modified by conjugation with a variety of 
molecules, such as polyethylene glycol (PEG) . The present invention 
20 provides such modified antibodies. The modified antibody can be 
obtained by chemically modifying an antibody. These modification 
methods are conventional in the field. 

Alternatively, an antibody of the present invention may be 
obtained as a chimeric antibody, between a variable region derived 
25 from nonhuman antibody and the constant region derived from human 
antibody, or as a humanized antibody, comprising the complementarity 
determining region (CDR) derived from nonhuman antibody, the frame 
work region (FR) derived from human antibody, and the constant region. 
Such antibodies can be prepared by using known technology. 
30 Obtained antibodies may be purified to homogeneity. The 

invention is not limited to one particular method for antibody 
separation and purification . Any standard method of protein separation 
and purification may be used. The concentration of the obtained 
antibody may be determined by measuring absorbance, by enzyme-linked 
35 immunosorbent assay (ELISA) , etc. 

ELISA, enzyme immunoassay (EIA) , radioimmunoassay (RIA) , and/or 



immunofluorescence may be used to measure the antigen binding activity 
of the antibody of the invention . In ELISA, the antibody of the present 
invention is immobilized on a plate, protein of the invention is applied 
to the plate, and then a sample containing a desired antibody, such 
as culture supernatant of antibody producing cells or purified 
antibodies, is applied. Then, a secondary antibody, that recognizes 
the primary antibody and is labeled with an enzyme such as alkaline 
phosphatase, is applied, and the plate is incubated. After washing , 
an enzyme substrate, such as p-nitrophenyl phosphate, is added to 
the plate, and the absorbance is measured to evaluate the antigen 
binding activity of the sample. A fragment of the protein, such as 
a C-terminal or N-terminal fragment , may be used as a protein . BIAcore 
(Pharmacia) may be used to evaluate the activity of the antibody 
according to the present invention. 

The above methods allow for the detection or measurement of the 
protein of the invention, by exposing the antibody of the invention 
to a sample assumed to contain the protein of the invention, and 
detecting or measuring the immune complex formed by the antibody and 
the protein. 

Because the method of detection or measurement of the protein 
according to the invention can specifically detect or measure a protein, 
the method may be useful in a variety of experiments in which the 
protein is used. 

The present invention also provides DNA of a length of 15 
nucleotides or longer that hybridize with DNA having the nucleotide 
sequence of SEQ ID NOs : 1 to 3 or 24 to 27, or with DNA complementary 
to the DNA (complementary strand). Thus, the present invention 
provides probes that can selectively hybridize with a DNA encoding 
a protein of the invention, a DNA complementary to such a DNA, or 
a nucleotide or its derivative, such as antisense oligonucleotide, 
a ribozyme, etc. 

The present invention comprises, for example, an antisense 
oligonucleotide that hybridizes with any portion of the nucleotide 
sequence of SEQ ID NOs: 1 to 3 and 24 to 27. The antisense 
oligonucleotide is preferably an antisense of a continuous sequence 
of a length of 15 nucleotides or longer within the nucleotide sequence 



of any of SEQ ID NOs : 1 to 3 and 2 4 to 27. More preferably, the above 
continuous sequence of a length of 15 nucleotides or longer contains 
the translation initiation codon. 

A derivative or modified form of antisense oligonucleotide may 
also be used . The latter formmay modified with lower alkylphosphonate 
such as methylphosphonate or ethylphosphonate , or with 
phosphorothioate, or phosphoroamidate . 

Herein, an antisense oligonucleotide is not restricted to one 
in which all nucleotides are complementary to the corresponding 
nucleotides within a given region of a DNA or mRNA; as long as it 
can selectively forma stable hybrid with any of the nucleotide sequences 
of SEQ ID NOs: 1 to 3 or 24 to 27, it may have one or more nucleotide 
mismatches . 

The phrase "selectively form a stable hybrid" is defined herein 
as not forming a significant cross hybrid with a DNA encoding other 
proteins under normal conditions of hybridization , or preferably under 
stringent conditions. Such DNA include those which have a homology 
of at least 70%, preferably 80% or higher, more preferably 90% or 
higher, and even more preferably 95% or higher within a continuous 
sequence of a length of 15 nucleotide or longer. To determine the 
degree of homology, the algorithm described herein may be used. As 
described in the following Examples, the above DNA is useful as a 
probe for detection or isolation of a DNA encoding the protein of 
the invention, or as a primer for its amplification. 

A derivative of an antisense oligonucleotide of a present 
invention may act on cells producing the protein of the invention 
and bind to a DNA or mRNA encoding the protein, and then, it may inhibit 
the expression of the protein of the invention by inhibiting its 
transcription or translation, or by promoting the degradation of mRNA, 
and thereby inhibiting the function of the protein of the invention. 

A derivative of an antisense oligonucleotide of the present 
invention may be mixed with an appropriate base which is inactive 
against the derivative, and used as a medicine for externally 
application such as salve or poultice. 

If necessary, it may be mixed with an excipient, isotonizing 
agent, solubilizing agent, stabilizer, preservative, pain-killer, 



or the like, and prepared as a tablet , powder, granule, capsule, liposome 
capsule, injectable solution, liquid formulation, nose drops, 
freeze-dried agent, etc. The above may be achieved according to 
standard methods . 

For treating patients, a derivative of an antisense 
oligonucleotide of the present invention may be, for example, directly 
applied to the affected area of a patient, or administered into blood 
vessels so as to finally reach the affected area. Moreover, the 
derivative may be encapsulated in antisense-encapsulating materials 
such as liposome, poly-L-lysine, lipid, cholesterol, lipofectin, or 
their derivative in order to increase durability and/or membrane 
permeability . 

Dose of the derivative of the antisense oligonucleotide of the 
present invention may be appropriately adjusted depending on the 
patient's conditions, and a favorable amount such as 0.1 to 100 mg/kg, 
or more preferably 0.1 to 50 mg/kg may be administered. 

As the antisense oligonucleotides of the present invention 
inhibit expression of the protein of the invention, it finds utility 
as an inhibitor of biological activity of the protein of the invention. 
An inhibitor of expression comprising the antisense oligonucleotide 
of the present invention is useful because it can inhibit the biological 
activity of the protein of the invention. 

Brief Description of the Drawings 

Figure 1 shows a comparison of the amino acid sequences among 
known human olfactory receptors and GTAR14-1 and GTAR14-2 . Shadowed 
residues are shared by at least four proteins out of five. Dotted 
residues are matched in all five proteins. Abbreviations are as 
follows : 

0LF1 for human olfactory receptor 1 (GenBank Accession! U56420 ) ; 
OLF2 for human olfactory receptor 2 (GenBank Accession! L35475) ; OLF3 
for human olfactory receptor 3 (GenBank Accession! L56421); 14-1 for 
GTAR14-1; and 14-2 for GTAR14-2 (pseudogene) . 

Figure 2 shows a comparison of the amino acid sequences among 
known human olfactory receptors and GTAR14-3 . Shadowed residues are 



shared by at least three proteins out of four. Dotted residues are 
matched in all four proteins. Abbreviations are as follows: 

0LF1 for human olfactory receptor 1 (GenBank Accession* U5642 0 ) ; 
OLF2 for human olfactory receptor 2 (GenBank Accession* L35475); OLF3 
for human olfactory receptor 3 (GenBank Accession* L56421); and 14-3 
for GTAR14-3. 

Figure 3 shows a comparison of the amino acid sequences among 
known human olfactory receptors and GTAR14-4 and GTAR14-5 . Shadowed 
residues are shared by at least three proteins out of five. Dotted 
residues are matched in all five proteins. Abbreviations are as 
follows : 

OLF1 for human olfactory receptor 1 (GenBank Accession* U56420 ) ; 
OLF2 for human olfactory receptor 2 (GenBank Accession* L354 75) ; OLF3 
for human olfactory receptor 3 (GenBank Accession* L56421); 14-4 for 
GTAR14-4 (pseudogene) ; and 14-5 for GTAR14-5. 

Figure 4 shows the nucleotide sequence of GTAR14-1 which was 
obtained by genomic PCR. The positions of the primers used are 
underlined. The right-directed and left-directed arrows indicate 
the positions of 14-1-S1 primer, and 14-1-A1 primer, respectively. 

Figure 5 shows the nucleotide sequence of GTAR14-3 which was 
obtained by genomic PCR. The positions of the primers used are 
underlined. The right-directed and left-directed arrows indicate 
the positions of 14-3-S1 primer, and 14-3-A1 primer, respectively. 

Figure 6 shows the nucleotide sequence of GTAR14-5 which was 
obtained by genomic PCR . The positions of the primers used are 
underlined. The right-directed and left-directed arrows indicate 
the positions of 14-5-S1 primer, and 14-5-A1 primer, respectively. 

Figure 7 is a photograph showing the result of RT-PCR analysis 
of the gene expression pattern of GTAR14-1 in human tissues. 

A: Human Fetal MTC Panel (Clontech #K1425-1) , B: Human MTC Panel 
I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 

Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 
liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: heart, 10:brain, 11: placenta, 12: lung, 13: liver, 14: skeletal 
muscle, 15: kidney, 16: pancreas, 17: spleen, 18: thymus, 19: prostate, 
20 : testis, 21 : ovary, 22 : small intestine, 23 : colon, and 24 : peripheral 
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leukocytes . 

The left lane of each panel shows the result of the positive 
control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
5 of the control reaction in which G3PDH was amplified from a standard 
CDNA as a template using a primer set for G3PDH. 

Figure 8 is a photograph showing the result of RT-PCR analysis 
of the gene expression pattern of GTAR14-3 in human tissues. 

A: Human Fetal MTC Panel (Clontech #K1425-1) , B: Human MTC Panel 
10 I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 

Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 
liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: heart, 10:brain, 11: placenta, 12: lung, 13: liver, 14: skeletal 
muscle, 15: kidney, 16: pancreas, 17: spleen, 18: thymus, 19: prostate, 
15 20 : testis , 21 : ovary, 22 : small intestine, 23 : colon, and 24 : peripheral 
leukocytes . 

The left lane of each panel shows the result of the positive 
control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
20 of the control reaction in which G3PDH was amplified from a standard 
cDNA as a template using a primer set for G3PDH. 

Figure 9 is a photograph showing the result of RT-PCR analysis 
of the gene expression pattern of GTAR14-5 in human tissues.' 

A: Human Fetal MTC Panel (Clontech #K1425-1) , B: Human MTC Panel 
25 I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 

Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 
liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: heart, 10:brain, 11: placenta, 12: lung, 13: liver, 14: skeletal 
muscle, 15: kidney, 16: pancreas, 17: spleen, 18: thymus, 19: prostate, 
30 20 : testis , 21 : ovary, 22 : small intestine, 23 : colon, and 2 4 : peripheral 
leukocytes . 

The left lane of each panel shows the result of the positive 
control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
35 of the control reaction in which G3PDH was amplified from a standard 
cDNA as a template using a primer set for G3PDH. 



Figure 10 shows the nucleotide sequence of the full length 
GTAR14-1 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 '-RACE. The amino acid sequence encoded by the GTAR14-1 cDNA 
is also indicated. The amino acid sequences predicted to be a 
transmembrane domain are underlined (TM-I, -II, -III, -IV, and -V) . 

Figure 11 shows the nucleotide sequence of the full length 
GTAR14-1 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 ' -RACE , continued f romFigure 10 . The amino acid sequence encoded 
by the GTAR14-1 cDNA is also indicated. The amino acid sequences 
predicted to be a transmembrane domain are underlined (TM-VI and -VII ) . 

Figure 12 shows the nucleotide sequence of the full length 
GTARl 4 -3 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 '-RACE. The amino acid sequence encoded by the GTAR14-3 cDNA 
is also indicated. Amino acid sequences predicted to be a 
transmembrane domain are underlined (TM-I, -II, -III, -IV, and -V). 

Figure 13 shows the nucleotide sequence of the full length 
GTAR14-3 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 ' -RACE , continued f romFigure 12 . The amino acid sequence encoded 
by the GTARl 4-3 cDNA is also indicated . Amino acid sequences predicted 
to be a transmembrane domain are underlined (TM-VI and -VII). 

Figure 14 shows the nucleotide sequence of the full length 
GTAR14-5 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 '-RACE. The amino acid sequence encoded by the GTAR14-5 cDNA 
is also indicated. Amino acid sequences predicted to be a 

transmembrane domain are underlined (TM-I, -II, -III, and -IV). 

Figure 15 shows the nucleotide sequence of the full length 
GTARl 4 -5 cDNA that was obtained by integrating the products of 5 ' -RACE 
and 3 ' -RACE, continued f romFigure 14 . The amino acid sequence encoded 
by the GTAR14-5 cDNA is also indicated. minAo acid sequences predicted 
to be a transmembrane domain are underlined (TM-V, -VI, and -VII). 

Figure 16 shows the nucleotide sequence of GTARl 1-1 which was 
obtained by genomic PCR. The positions of the primers used are 
underlined. The right-directed and left-directed arrows indicate 
the positions of 11-1-S2 primer, and 11-1-A3 primer, respectively. 

Figure 17 shows the nucleotide sequence of GTARl 1-2 which was 
obtained by genomic PCR. The positions of the primers used are 
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underlined. The right-directed and left-directed arrows indicate 
the positions of 11-2-S2 primer, and 11-2-A2 primer, respectively. 

Figure 18 shows the nucleotide sequence of GTAR11-3 which was 
obtained by genomic PCR . The positions of primers used are underlined . 
5 The right-directed and left-directed arrows indicate the positions 
of 11-3-S2 primer, and 11-3-A2 primer, respectively. 

Figure 19 shows the nucleotide sequence of GTARll-4 which was 
obtained by genomic PCR. The positions of the primers used are 
underlined. The right-directed and left-directed arrows indicate 
10 the positions of 11-4-S2 primer, and 11-4-A2 primer, respectively. 

Figure 20 is a photograph showing the result of RT-PCR analysis 
of the gene expression pattern of GTAR11-1 in human tissues. 

A: Human Fetal MTC Panel (Clontech #K1425-1), B: Human MTC Panel 
I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 
15 Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 

liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: brain, 10: heart, 11: kidney, 12: liver, 13: lung, 14: pancreas, 
15: placenta, 16: skeletal muscle , 17: colon, 18: ovary, 19: peripheral 
leukocytes, 20: prostate, 21: small intestine, 22: spleen, 23: testis, 
20 and 24: thymus. 

The left lane of each panel shows the result of the positive 
control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
of the control reaction in which G3PDH was amplified from a standard 
25 cDNA as a template using a primer set for G3PDH. 

Figure 21 is a photograph showing the result of RT-PCR analysis 
of the gene expression pattern of GTAR11-2 in human tissues. 

A: Human Fetal MTC Panel (Clontech #K1425^1) , B: Human MTC Panel 
I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 
30 Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 

liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: brain, 10: heart, 11: kidney, 12: liver, 13: lung, 14: pancreas, 
15: placenta, 16: skeletal muscle , 17: colon, 18: ovary, 19: peripheral 
leukocytes, 20: prostate, 21: small intestine, 22: spleen, 23: testis, 
35 and 24: thymus. 

The left lane of each panel shows the result of the positive 
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control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
of the control reaction in which G3PDH was amplified from a standard 
cDNA as a template using a primer set for G3PDH. 
5 Figure 22 is a photograph showing the result of RT-PCR analysis 

of the gene expression pattern of GTAR11-3 in human tissues, 

A: Human Fetal MTC Panel (Clontech #K1425-1 ) , B: Human MTC Panel 
I (Clontech #K1420-1), C: Human MTC Panel II (Clontech #K1421-1). 

Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 
10 liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: brain, 10: heart, 11: kidney, 12: liver, 13: lung, 14: pancreas, 
15: placenta, 16: skeletal muscle, 17: colon, 18: ovary, 19: peripheral 
leukocytes, 20: prostate, 21: small intestine, 22: spleen, 23: testis, 
and 24: thymus. 

15 The left lane of each panel shows the result of the positive 

control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
of the control reaction in which G3PDH was amplified from a standard 
cDNA as a template using a primer set for G3PDH. 
20 Figure 23 is a photograph showing the result of RT-PCR analysis 

of the gene expression pattern of GTAR11-4 in human tissues. 

A: Human Fetal MTC Panel (Clontech #K1425-1), B: Human MTC Panel 
I (Clontech #K1420-1), C : Human MTC Panel II (Clontech #K1421-1). 

Lanes 1: fetal brain, 2: fetal heart, 3: fetal kidney, 4: fetal 
25 liver, 5: fetal lung, 6: fetal muscle, 7: fetal spleen, 8: fetal thymus, 
9: brain, 10: heart, 11: kidney, 12: liver, 13: lung, 14: pancreas, 
15: placenta, 16: skeletal muscle, 17: colon, 18: ovary, 19: peripheral 
leukocytes, 20: prostate, 21: small intestine, 22: spleen, 23: testis, 
and 24: thymus. 

30 The left lane of each panel shows the result of the positive 

control in which human genomic DNA was amplified as a template using 
the same primer set. The right lane of each panel shows the product 
of the control reaction in which G3PDH was amplified from a standard 
cDNA as a template using a primer set for G3PDH. 

35 Figure 24 shows the nucleotide sequence of the GTAR11-1 . cDNA 

that was obtained by integrating the products of 5 ' -RACE and 3 ' -RACE . 



- 34 - 



The amino acid sequence encoded by the GTAR11-1 cDNA is also indicated. 
Amino acid sequences predicted to be a transmembrane domain are 
underlined (TM-I, -II, -III, -IV, -V, -VI, and -VII). 

Figure 25 shows the nucleotide sequence of the GTAR11-2 cDNA 
5 that was obtained by integrating the products of 5 ' -RACE and 3 ' -RACE . 
The amino acid sequence encoded by the GTAR11-2 cDNA is also indicated. 
Amino acid equences predicted to be a transmembrane domain are 
underlined (TM-I, -II, -III, -IV, -V, -VI, and -VII). 

Figure 2 6 shows the nucleotide sequence of the GTAR11-3 cDNA 
10 that was obtained by integrating the products of 5 ' -RACE and 3 '-RACE, 
The amino acid sequence encoded by the GTAR11-3 cDNA is also indicated . 
Amino acid sequences predicted to be a transmembrane domain are 
underlined (TM-I, -II, -III, -IV, -V, -VI, and -VII). 

Figure 2 7 shows the nucleotide sequence of the GTAR11-4 cDNA 
15 that was obtained by integrating the products of 5 ' -RACE and 3 '-RACE. 
The amino acid sequence encoded by the GTARll-4 cDNA is also indicated. 
Amino acid sequences predicted to be a transmembrane domain are 
underlined (TM-I, -II, -III, -IV, -V, -VI, and -VII). 

Figure 2 8 shows comparison of the amino acid sequences among 
20 known human olfactory receptors and GTAR11-1 . Shadowed residues are 
shared by at least three proteins out of four. Dotted residues are 
matched in all four proteins. The N residue of the [N-x-S/T] motif 
( x , any amino acid residue ) that is modified by sugar , conserved cysteine 
residues that form a disulfide bond required for cross-linking, and 
25 theD-R-Y sequence in the [M-A-Y-D-R-Y-L/V-A-I/v-C ] motif in the second 
intracellular domain, which is suggested to be essential for the binding 
to intracellular G proteins, are underlined. Abbreviations are as 
follows : 

OLF1 for human olfactory receptor 1 (GenBank Accession* U56420 ) ; 
30 OLF2 for human olfactory receptor 2 (GenBank Accession* L35475) ; OLF3 
for human olfactory receptor 3 (GenBank Accession* L56421) ; and 11-1 
for GTAR11-1. 

Figure 2 9 shows comparison of the amino acid sequences among 
known human olfactory receptors and GTAR11-2. Dotted residues are 
35 matched in all three proteins. The N residue of [N-x-S/T] motif (x, 
any amino acid residue) that is modified by sugar, conserved cysteine 
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residues that form a disulfide bond required for cross-linking, and 
the D-R-Y sequence in [M-A-Y-D-R-Y-L/V-A-I/V-C ] motif in the second 
intracellular domain, which is suggested to be essential for the binding 
to intracellular G proteins, are underlined. Abbreviations are as 
5 follows: 

0LF2 for human olfactory receptor 2 (GenBank Accession* L354 75 ) ; 
0LF3 for human olfactory receptor 3 (GenBank Accession! L56421); and 
11-2 for GTARll-2. 

Figure 30 shows comparison of the amino acid sequences among 

10 known human olfactory receptors and GTAR11-3- Dotted residues are 
matched in all three proteins. The N residue of [N-x-S/T] motif (x, 
any amino acid residue) that is modified by sugar, conserved cysteine 
residues that form a disulfide bond required for cross-linking, and 
the D-R-Y sequence in [M-A-Y-D-R-Y-L/V-A-I/V-C] motif in the second 

15 intracellular domain, which is suggested to be essential for the binding 
to intracellular G proteins, are underlined. Abbreviations are as 
follows : 

OLF2 for human olfactory receptor 2 (GenBank Accession* L35475 ) ; 
OLF3 for human olfactory receptor 3 (GenBank Accession* L56421); and 

20 11-3 for GTAR11-3. 

Figure 31 shows comparison of the amino acid sequences among 
known human olfactory receptors and GTAR11-4. Dotted residues are 
matched in all three proteins. The N residue of [N-x-S/T] motif (x, 
any amino acid residue) that is modified by sugar, conserved cysteine 

25 residues that form a disulfide bond required for cross-linking, and 
the D-R-Y sequence in [M-A-Y-D-R-Y-L/V-A-I/V-C ] motif in the second 
intracellular domain, which is suggested to be essential for the binding 
to intracellular G proteins, are underlined. Abbreviations are as 
follows : 

30 OLF2 for human olfactory receptor 2 (GenBank Accession* L35475 ) ; 

OLF3 for human olfactory receptor 3 (GenBank Accession* L56421); and 
11-4 for GTAR11-4. 



Best Mode for Carrying Out the Invention 



35 Example 1. Isolation and analysis of GTAR14 genes 



( 1 ) BLAST search 
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BLAST search was performed on the Non-redundant Sequence (nr) 
database in GenBank, and a human genomic sequence that contains a 
gene encoding a novel seven transmembrane G protein-coupled receptor, 
which has a high structural homology to known olfactory receptor (OR) 
5 gene family, was found. The gene is contained in the sequence of 
BAC129 clone (GenBank Accession* U85195), originating from the 
vicinity of the a/6 T-cell receptor locus on human chromosome 14, and 
shows about 50 to 60% homology to known human olfactory receptors, 
OLF1, OLF2, and OLF3 at the amino acid sequence level. 

10 Based on the results of analysis of gene expression pattern as 

described below and prediction of function, the three isolated genes 
were named G protein-coupled T-cell Activating Receptor 14-1 
(GTAR14-1), G protein-coupled T-cell Activating Receptor 14-3 
(GTAR14-3), and G protein-coupled T-cell Activating Receptor 14-5 

15 (GTAR14-5), respectively. 

The genomic structures of known OR gene family have a unique 
feature in that the ORF from the initiation codon to the stop codon 
is contained in a single exon of the genomic sequence encoding the 
known OR gene. The full length ORFs of GTAR14-1, GTAR14-3, and 
20 GTAR14-5 are also contained in a sequence that is supposed to be a 
single exon. 

Within the sequence of the BAC129 clone, other novel OR-like 
genes were also found, which were named GTAR1 4 -2 and GTAR 14-4 . However, 
in order to obtain identity in amino acid sequence to other OR gene 

25 family proteins, the ORF of GTAR14-2 must be split in two by deletion 
of a single nucleotide in the middle. On the other hand, by comparing 
sequences of GTAR14-4 with other OR genes to obtain identity at the 
amino acid level, the ORF of GTAR14-4 has to be divided into two portions 
by an in-frame stop codon in the second extracellular domain ( loop-1 ) . 

30 Accordingly, these genes were judged to be a typical pseudogene of 
the OR gene family, and were excluded from further study. 

In Figure 1, comparison of the amino acid sequences of known 
human OLF1, OLF2 , and OLF3, and proteins of the present invention, 
GTAR14-1 and GTAR14-2, is shown. The sequence of GTAR14-2 is of an 

35 artificial translational frame created by inserting a single 
nucleotide at the position of the missing nucleotide. 



Figure 2 shows comparison of the amino acid sequences among known 
human 0LF1, OLF2 , and 0LF3 , and protein of the present invention, 
GTAR14-3 . 

Figure 3 shows comparison of the amino acid sequences among known 
human 0LF1, 0LF2 , and 0LF3 , and proteins of the present invention, 
GTAR14-4 and GTAR14-5. The amino acid sequence of GTAR14-4 is 
artificially created by reading through the in frame stop codon. 

(2) Isolation of the genes of GTAR14-1, GTAR14-3, and GTAR14-5 
by genomic PCR 

The sequence of BAC129 clone U85195 was analyzed for the presence 
of an exon that is broadly conserved in known OLF receptors at the 
amino acid sequence level, and the predicted exon was used to design 
primers as follows. 

For amplification of GTAR14-1, GTAR14-1-S1 (5' -ATG GAC AGT CTA 
AAC CAA ACA AGA GTG-3 ' ; SEQ ID NO: 7), GTAR14-1-S2 ( 5 ' -ATG GCA TTC 
TCA GCC ATT TAT ATG CTA-3 ' ; SEQ ID NO: 8), and GTAR14-1-S3 ( 5 ' — GGG 
AAC ATT CTC ATC ATC ATT GCC ACA-3 ' ; SEQ ID NO: 9) were used as sense 
primers, and GTAR14-1-A1 (5'-TTA TGT ATA TGA TTT CGT GAA AAA AAC-3 ' ; 
SEQ ID NO: 10 ) , GTAR14-1-A2 ( 5 ' -TAC CTC CTC ATT CCT CAA GGT GTA AAT-3 ' ; 
SEQ ID NO: 11), and GTAR14-1-A3 ( 5 ' -GGT GAC CAC TGT GTA GAA GAC AGA 
CAC-3 ' ; SEQ ID NO: 12) were used as antisense primers. 

For amplification of GTAR14-3, GTAR14-3-S1 ( 5 ' -ATG GAA AGA ATC 
AAC AGC ACA CTG TTG-3 ' ; SEQ ID NO: 13), and GTAR14-3-S2 ( 5 ' -TCT AAT 
CTA CAT CCT GAC TCA GCT GGG- 3 ' ; SEQ ID NO: 14) were used as sense 
primers, and GTAR14-3-A1 (5'-TTC AAA CCT CAC TCG GAG TTC TTG GGC-3 ' ; 
SEQ ID NO: 15), and GTAR14-3-A2 (5 7 -AGC TTC ACC TCT TGG TTC CGC AGA 
GTG-3'; SEQ ID NO: 16) were used as antisense primers. 

For amplification of GTAR14-5, GTAR14-5-S1 ( 5 ' —ATG GGA AAG ACC 
AAA AAC ACA TCG CTG-3 ' ; SEQ ID NO: 17), and GTAR14-5-S2 (5'-CGT GGT 
GAC AGA TTT CAT TCT TCT GGG- 3 ' ; SEQ ID NO : 18) were used as sense 
primers, and GTAR14-5-A1 (5'-TCA ACC TGC TGT TAT CCT CTT CAG GGC-3'; 
SEQ ID NO: 19), and GTAR14-5-A2 ( 5 ' -CCT GGT TCC TCA GTG TAT AGA TGA 
GGG- 3 ' ; SEQ ID NO: 20) were used as antisense primers. 

Genomic PCR was performed using Human Genomic DNA (Clontech 
#6550-1) as a template, and primer sets of "14-1-Sl and 14-1-Al," 
" 14-1-S1 and 14-1-A2 , " " 14-1-S2 and 14-1-Al , " and "14-1-S2 and 14-1-A2" 
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for amplification of GTAR14-1, "14-3-S1 and 14-3-A1," "14-3-S1 and 
14-3-A2," ''14-3-32 and 14-3-Al," and "14-3-S2 and 14-3-A2" for 
amplification of GTAR14-3, and "14-5-S1 and 14-5-A1," "14-5-S1 and 
14-5-A2," "14-5-S2 and 14-5-Al," and "14-5-S2 and 14-5-A2" for 
5 amplification of GTAR14-5. PCR was performed using the Advantage 
cDNA Polymerase Mix (Clontech #8417-1) on a thermal cycler (Perkin 
Elmer Gene Amp PCR System 24 00 ) . By performing PCR under the following 
conditions, a single band was observed with a predicted size for each 
primer set. 

10 PCR condition: 94°C for 4 min, 5 cycles of 94°C for 20 sec and 

72°C for 2 min, 5 cycles of 94°C for 20 sec and 70°C for 2 min, 28 
cycles of 94°C for 20 sec and 68°C for 2 min, 72°C for 4 min, and 
termination at 4°C. 

The obtained products of genomic PCR were subcloned into the 

15 pGEM-T Easy vector (Promega #A1360), and the nucleotide sequence was 
determined. The PCR products were cloned into the pGEM-T Easy vector 
by ligation using T4 DNA ligase (Promega #A1360) at 4°C for 12 hr. 
The sample was used to transform E. coll DH5a (Toyobo # DNA-903) to 
obtain recombinants of the PCR products and the pGEM-T Easy vector, 

20 Recombinants were selected by using the Insert Check Ready (Toyobo 
#PIK-101). The nucleotide sequence was determined by using the 
dRhodamine Terminator Cycle Sequencing kit (ABI/ Perkin Elmer #4303141 ) 
on a DNA Sequencer (ABI PRISM 377). 

Ten independent clones of recombinants were selected for each 

25 gene from the products of genomic PCR that had been performed using 
a set of primers that are located in the most external sites of the 
target gene fragment; that is, the product of the primer set of "14-1 -SI 
and 14-1-A1" for GTAR14-1 gene, that of "14-3-S1 and 14-3-Al" for 
GTAR14-3 gene, and that of "14-5-S1 and 14-5-Al" for GTAR14-5 gene, 

30 The complete nucleotide sequence of the inserts of all the clones 
were determined. 

As a result, all clones of GTAR14-1 gene and GTAR14-3 gene showed 
a single nucleotide sequence of 942 bp. All clones of GTAR14-5 
exhibited a single nucleotide sequence of 933 bp. The obtained 

35 sequences were verified to be a partial nucleotide sequence of 
respective genes and confirmed to not be a non-specific product of 
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genomic PCR. Accordingly, these primer sets are expected to be able 
to work correctly in RT-PCR as described in the following (3) . The 
partial nucleotide sequences of GTAR14-1, GTAR14-3, and GTAR14-5, 
obtained by genomic PCR, are shown in Figures 4, 5, and 6, and SEQ 
5 ID NO: 21, 22, and 23, respectively. 

(3) Analysis of tissue distribution and patterns of GTAR14-1 
GTAR14-3, and GTAR14-5 gene expressions by RT-PCR 

For analysis of the tissue distribution and patterns of GTAR14-1 , 
GTAR14-3, and GTAR14-5 gene expressions in human tissues, RT-PCR was 

10 performed using a primer set of "14-1-S2 and 14-1-A2," "14-3-S2 and 
14-3-A2," and "14-5-S2 and 14-5-A2," respectively. 

Human Fetal Multiple Tissue cDNAPanel (Clontech #K1425-1 ) , Human 
Multiple Tissue cDNA Panel I (Clontech #K1420-1) , and Human Multiple 
Tissue cDNA Panel II (Clontech #K1421-1) were used as a template. 

15 PCR was performed by using the Advantage cDNA Polymerase Mix (Clontech 
#8417-1) on a thermal cycler (Perkin Elmer Gene Amp PCR System 2400) . 
PCR reactions were performed under conditions with a single cycle 
of 94°C for 4 min, sequentially followed by 5 cycles of 94°C for 20 
sec and 72°C for 2 min, 5 cycles of 94°C for 20 sec and 70°C for 2 

20 min, 38 cycles of 94°C for 20 sec and 68°C for 2 min, a single cycle 
of 72°C for 4 min, and termination at 4°C. 

The result showed that a strong amplification of GTAR14-1 mRNA 
was observed only in thymus among human fetal tissues, and also in 
lymphatic /hematopoietic tissues including thymus and spleen, and 

25 testis and pancreas among adult tissues (Figure 7). 

GTAR14-3 was strongly amplified in thymus, brain, heart, smooth 
muscle, and kidney among human fetal tissues, and weakly expressed 
in liver and lung as well. GTAR14-3 was strongly detected in thymus 
and testis among adult tissues, and also expressed in digestive organs 

30 such as small intestine and colon , and endocrine organs such as placenta 
and prostate, showing a broad distribution among tissues (Figure 8) . 

GTAR14-5 was strongly amplified in thymus , spleen, and lung among 
human fetal tissues, and weakly expressed in brain and kidney. It 
was detected strongly in lymphatic /hematopoietic tissues such as 

35 thymus, spleen, and peripheral leukocytes, and testis, and pancreas 
in adult tissues (Figure 9). 
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The products of PCR were subcloned into the pGEM-T Easy vector 
(Promega #A1360) , and the nucleotide sequence were determined. The 
PCR products were cloned into the pGEM-T Easy vector by ligation using 
T4 DNA ligase (Promega #A1360) at 4°C for 12 hr. The samples were 

5 used to transforms, coli DH5a (Toyobo # DNA-903 ) to obtain recombinants 
of the PCR products and the pGEM-T Easy vector. Recombinants were 
selected by using the Insert Check Ready (Toyobo #PIK-101). The 
nucleotide sequence were determined by using the dRhodamine Terminator 
Cycle Sequencing kit (ABI/Perkin Elmer #4303141) on a DNA Sequencer 

10 (ABI PRISM 377) . For each gene, ten independent recombinant clones 
were selected, and the complete nucleotide sequence of the inserts 
were determined. As a result, all clones showed a single nucleotide 
sequence . 

The obtained sequences were verified to be a partial sequence 
15 of the GTAR14-1, GTAR14-3, or GTAR14-5, and confirmed not to be a 
non-specific product of RT-PCR. 

(4) Full length cDNA cloning by 5'- and 3 ' -RACE 
( i ) 5 ' -RACE 

5'-RACE-PCR was performed to isolate full length cDNAs of 
20 GTAR14-1, GTAR14-3, and GTAR14-5. Primers used for amplification 
of GTAR14-1, GTAR14-3, and GTAR14-5 genes were "14-1-Al," "14-3-Al," 
and " 14-5-A1" for the first PCR, and " 14-1-A2 , " " 14-3-A2 , " and " 14-5-A2" 
for the second PCR, respectively. 

Human Testis Marathon-Ready cDNA Library (Clontech #7414-1) was 
25 used as a template, and the Advantage cDNA Polymerase Mix was used 
for PCR. PCR was performed on a thermal cycler (Perkin Elmer Gene 
Amp PCR System 2400) under the following conditions: 

first PCR: a single cycle of 94°C for4min, sequentially followed 
by 5 cycles of 94°C for 20 sec and 72°C for 2 min, 5 cycles of 94°C 
30 for 20 sec and 70°C for 2 min, 30 cycles of 94°C for 20 sec and 68°C 
for 2 min, and a single cycle of 72°C for 4 min followed by termination 
at 4°C; 

secondPCR: a single cycle of 94°C for 4 min, sequentially followed 
by 5 cycles of 94°C for 20 sec and 72°C for 2 min, 28 cycles of 94°C 
35 for 20 sec and 68°C for 2 min, and a single cycle of 72°C for 4 min 
followed by termination at 4°C. 
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The obtained 5'-RACE-PCR products, which showed a single band, 
were subcloned into the pGEM-T Easy vector as described above. For 
six independent clones of recombinants, the nucleotide sequence of 
all inserts were determined by using the dRhodamine Terminator Cycle 
5 Sequencing kit (ABI/Perkin Elmer #4303141) on a DNA Sequencer (ABI 
PRISM 377), as described above. 

(ii) 3 '-RACE 

3'-RACE-PCR was further performed to isolate full length cDNAs 
of GTAR14-1, GTAR14-3, and GTAR14-5 genes. Primers used for 

10 amplification of GTAR14-1, GTAR14-3, and GTAR14-5 genes were 
"14-1-S1," "14-3-S1," and "14-5-S1" for first PCR, and "14-1-S2," 
"14-3-S2," and "14-5-S2" for second PCR, respectively. 

Again, Human Testis Marathon-Ready cDNA Library (Clontech 
#7414-1 ) was used as a template. PCR was performed using the Advantage 

15 cDNA Polymerase Mix under similar conditions to that for 5 ' -RACE-PCR 
described above, and PCR products of a single size were obtained. 
As described above, the obtained PCR products for respective genes 
were subcloned into the pGEM-T Easy vector . For six independent clones 
of recombinants , the nucleotide sequence of all inserts were determined 

20 by using the dRhodamine Terminator Cycle Sequencing kit (ABI/Perkin 
Elmer #4303141) on a DNA Sequencer (ABI PRISM 377). For each gene, 
all six independent clones showed a single nucleotide sequence. 
Finally, the nucleotide sequences obtained by 5 ' -RACE and 3 '-RACE 
were combined to determine the nucleotide sequence of full length 

25 cDNA of GTAR14-1 , GTAR14-3 , and GTAR14-5 . The determined nucleotide 
sequences of full length cDNAs are shown in Figures 10 and 11 (GTAR14-1 ) , 
Figures 12 and 13 (GTAR14-3), and Figures 14 and 15 (GTAR14-5), and 
SEQ ID NO: 1 (GTAR14-1), SEQ ID NO: 2 (GTAR14-3), and SEQ ID NO: 3 
(GTAR14-5 ) . 

30 

Example 2. Isolation and analysis of GTAR11 genes 
(1) BLAST search 

When members of known olfactory receptor (OR) gene family are 
structurally compared one another, they are found to share a high 
35 homology in the second and seventh transmembrane domains at the amino 
acid sequence level. Accordingly, among these two domains, the 
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present inventors chose a conserved amino acid sequence 
[LHTPMYFFLSNLSF] within the second transmembrane domain for query 
in BLAST search on the GenBank High Throughput Genomic Sequence (htgs) 
database, and tried to find a sequence of a novel useful gene belonging 
5 to the OR gene family . BLAST search was performed using default values 
of the TblastN ( Ver . 2 .0.5) program. Besides detecting many sequences 
containing a known OR genes as a positive control, multiple human 
genomic sequences having a high structural homology to known OR genes 
were found. These sequences are found in a sequence of BAC clone 

10 originating from human chromosome 11 (GenBank Accession* AC002556) , 
and show about 30 to 4 0% homology to known human olfactory receptor 
1 (0LF1), OLF2 , and OLF3 at the amino acid sequence level. Based 
on the results of analysis of gene expression pattern as described 
below and prediction of function, the four discovered genes were named 

15 G protein-coupled T-cell Activating Receptor 11-1 (GTAR11-1), G 
protein-coupled T-cell Activating Receptor 11-2 (GTARll-2), G 
protein-coupled T-cell Activating Receptor 11-3 (GTAR11-3), and G 
protein-coupled T-cell Activating Receptor 11-4 (GTAR11-4), 
respectively . 

20 (2) Isolation of GTAR11 genes by genomic PGR 

The sequence of the BAC clone AC002556 was analyzed for the 
presence of an exon that is broadly conserved in known OLF receptors 
at the amino acid sequence level, and the exon predicted was used 
to design primers as follows. 

25 For amplification of GTAR11-1, GTAR11-1-S1 ( 5 ' -GAA GAG CAG TGA 

GGG TCC ATG TTA AGG-3 ' ; SEQ ID NO: 34), and GTAR11-1-S2 ( 5 ' -CAG CAG 
CTT GTC CTT CGT CGA TTT CTG C-3 ' ; SEQ ID NO: 35) were used as sense 
primers , and GTAR11-1-A2 ( 5 ' -GCT AGG GTG GGC ACC AAG GTG TTA AAC CC-3 ' ; 
SEQ ID NO: 36), and GTAR11-1-A3 ( 5 ' -TGC AAA AGG ACA GTT TCA TCA TGG 

30 CAC-3'; SEQ ID NO: 37) were used as antisense primers. 

For amplification of GTARll-2, GTAR11-2-S1 (5'-CAA AGA ACT CAC 
CCA AAT TCC TAC AGC T-3 ' ; SEQ ID NO: 38), and GTAR11-2-S2 ( 5 ' -CAT 
GGT AGG CAA CCT TGG CTT GAT CAC-3' ; SEQ ID NO : 39) were used as sense 
primers , and GTAR1 1-2-A1 ( 5 ' -GTT TAT TAA ATC ACA CAT AAC ACC ATC TG-3 ' ; 

35 SEQ ID NO: 40), and GTAR11-2-A2 ( 5 ' -CAG AGA CAG AGC AAT GAC ATG AGA 
GCT AC-3'; SEQ ID NO: 41) were used as antisense primers. 
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For amplification of GTAR11-3, GTAR11-3-S1 ( 5 ' -CAA AGA ACT CAC 
CCA AAT TCC TAC AGC C-3 ' ; SEQ ID NO: 42), and GTAR11-3-S2 ( 5 ' -CAT 
GGT AGG CAA CCT TGG CTT GAT CAT-3 ' ; SEQ ID NO: 43) were used as sense 
primers , and GTAR1 1-3-A1 ( 5 ' — GTT TAT TAA ATC ACA CAT AAC ACC ATC TG-3 ' ; 
5 SEQ ID NO: 44), and GTAR11-3-A2 ( 5 ' -CAG AGA CAG AGC AAT GAC ATG AGA 
GCT AC-3'; SEQ ID NO: 45) were used as antisense primers. 

For amplification of GTAR11-4, GTAR11-4-S1 ( 5 ' -CCA GAC AGC TCG 
CCA AGA GAG AAT GAC-3 ' ; SEQ ID NO: 46), and GTAR11-4-S2 ( 5 ' -CCT TTA 
TAG ATC TCT GTT ATT CCT GTG TG-3' ; SEQ ID NO: 47) were used as sense 
10 primers, and GTAR11-4-A1 (5' -TCG GTT GCC AGT GAT ATG AAG AGA CCC-3 ' ; 
SEQ ID NO: 48), and GTAR11-4-A2 ( 5 ' -GGC TTT GGA TCT GCC CTC TGC AGA 
AGG- 3 ' ; SEQ ID NO: 49) were used as antisense primers. 

Genomic PCR was performed using Human Genomic DNA (Clontech 
#6550-1) as a template, and primer sets of "11-1-S1 and 11-1-A2," 

15 "11-1-S1 and 11-1-A3," "11-1-S2 and 11-1-A2, " and "11-1-S2 and 11-1-A3" 
for amplification of GTARll-1, "11-2 -SI and 11-2-A1," "11 -2 -SI and 
11-2-A2," "11-2-S2 and 11-2-A1," and "11-2-S2 and 11-2-A2" for 
amplification of GTAR11-2, "11-3-S1 and 11-3-A1," "11-3-S1 and 
11-3-A2," "11-3-S2 and 11-3-Al," and "11-3-S2 and 11-3-A2" for 

20 amplification of GTAR11-3, and "11-4-S1 and 11-4-A1," "11-4-S1 and 
11-4-A2," "11-4-S2 and 11-4-Al," and "11-4-S2 and 11-4-A2" for 
amplification of GTARll-4. PCR was performed using the Advantage 
cDNA Polymerase Mix (Clontech #8417-1) on a thermal cycler (Perkin 
Elmer Gene Amp PCR System 2400) . Following conditions were used for 

25 PCR; a single cycle of 94°C for 4 min, sequentially followed by 5 
cycles of 94°C for 20 sec and 72°C for 2 min, 5 cycles of 94°C for 
20 sec and 70°C for 2 min, 28 cycles of 94°C for 20 sec and 68°C for 
2 min, and then a single cycle of 72°C for 4 min followed by termination 
at 4°C. As a result, a single band was detected with predicted size 

30 for each primer set. 

The products obtained in the above genomic PCR were subcloned 
into the pGEM-T Easy vector (Promega #A1360), and the nucleotide 
sequence were determined- PCR products were cloned into the pGEM-T 
Easy vector by ligation using T4 DNA ligase (Promega #A1360) at 4°C 

35 for 12 hr. The samples were used to transform coli DH5a (Toyobo 
# DNA-903) to obtain recombinants of the PCR products and the pGEM-T 
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Easy vector. Recombinants were selected by using the Insert Check 
Ready (Toyobo #PIK-101). The nucleotide sequences were determined 
by using the dRhodamine Terminator Cycle Sequencing kit (ABI/Perkin 
Elmer #4303141) on a DNA Sequencer (ABI PRISM 377). 
5 For each gene, ten independent clones of recombinants were 

selected from the products of genomic PGR that had been performed 
using a set of primers that are located within the ORF of the target 
gene fragment; that is the product of the primer set of "11-1-S2 and 
11-1-A3" for GTARl 1-1 gene, that of "11-2-S2 and 11-2-A2" for GTARll-2 

10 gene, that of "11-3-S2 and 11-3-A2" for GTARll-3 gene, and that of 
»H_4_S2 and 11-4-A2" for GTARl 1-4 gene. The nucleotide sequence 
of all the inserts were determined. 

As a result, a single nucleotide sequence of 450 bp was obtained 
from all clones of GTARll-1 gene. From all clones of GTARll-2 and 

15 GTARll-3 genes, a single nucleotide sequence of 637 bp was obtained. 
A single nucleotide sequence of 509 bp was obtained from all clones 
of GTARl 1-4 gene. The obtained sequences were verified to be a partial 
nucleotide sequence of the respective genes to confirm that they are 
not a non-specific product of genomic PGR. Accordingly, these primer 

20 sets are expected to work correctly in RT-PCR described in the following 
(3). The partial nucleotide sequences of GTARll-1, GTARll-2, 
GTARll-3 , and GTARl 1-4 obtained by the genomic PCR are shown in Figures 
16, 17, 18, and 19, and SEQ ID NO: 50, 51, 52, and 53, respectively. 

(3) Analysis of tissue distribution and patterns of GTARl 1 gene 

25 expressions by RT-PCR 

For analysis of the tissue distribution and patterns of GTARl 1 
gene expressions in human tissues, RT-PCR was performed using a primer 
set of "11-1-S2 and 11-1-A2," "11-2-S2 and 11-2-A2," "11-3-32 and 
11-3-A2," and "11-4-S2 and 11-4-A2" for GTARll-1, GTARll-2, GTARll-3, 

30 and GTARl 1-4 genes, respectively. 

Human Fetal Multiple Tissue cDNAPanel ( Clontech #K1425-1 ) , Human 
Multiple Tissue cDNA Panel I (Clontech #K1420-1) , and Human Multiple 
Tissue cDNA Panel II (Clontech #K1421-1) were used as a template.^ 
PCR was performed by using the Advantage cDNA Polymerase Mix (Clontech 

35 #8417-1) on a thermal cycler (Perkin Elmer Gene Amp PCR System 2400) . 
PCR reactions were performed under conditions with a single cycle 



of 94°C for 4 min, sequentially followed by 5 cycles of 94°C for 20 
sec and 72°C for 2 min, 5 cycles of 94°C for 20 sec and 70°C for 2 
min, 38 cycles of 94°C for 20 sec and 68°C for 2 min, and then a single 
cycle of 72°C for 4 min followed by termination at 4°C. 

The result showed a strong amplification of GTAR11-1 mRNA in 
spleen and weak expression in brain, lung, and smooth muscle among 
human fetal tissues. GTARll-1 gene was also expressed in multiple 
tissues in adult including lymphatic /hematopoietic tissues such as 
thymus, spleen, and peripheral lymphocytes, digestive organs such 
as small intestine and colon, genital organs, neuronal tissues, and 
endocrine organs (Figure 20). 

GTAR11-2 gene was strongly amplified in thymus, and weakly 
expressed in spleen, smoothmuscle, lung, and kidney among fetal tissues . 
GTAR11-2 gene was also detected in multiple tissues in adult including 
lymphatic /hematopoietic tissues such as thymus and peripheral 
lymphocytes, digestive organs, genital organs, neuronal tissues, and 
endocrine organs (Figure 21). 

GTAR11-3 gene was strongly amplified in smoothmuscle among human 
fetal tissues, and weakly expressed in brain, lung, and liver as well. 
The gene was also detected in multiple tissues in adult including 
lymphatic /hematopoietic tissues such as thymus , spleen, and peripheral 
lymphocytes , digestive organs such as small intestine and colon , muscle , 
genital organs, neuronal tissues, and endocrine organs (Figure 22). 

GTAR11-4 gene was strongly amplified only in smooth muscle in 
fetus. It was also expressed in multiple tissues in adult including 
lymphatic /hematopoietic tissues such as thymus, spleen, and peripheral 
lymphocytes , digestive organs such as small intestine and colon , muscle , 
genital organs, and endocrine organs (Figure .23) . 

The obtained PCR products were subcloned into the pGEM-T Easy 
vector (Promega #A1360) , and the nucleotide sequence were determined. 
The PCR products were cloned into the pGEM-T Easy vector by ligation 
using T4 DNA ligase (Promega #A1360) at 4°C for 12 hr. The samples 
were used to transform E. coli DH5a (Toyobo # DNA-903) to obtain 
recombinants of the PCR products and the pGEM-T Easy vector. 
Recombinants were selected by using the Insert Check Ready (Toyobo 
#PIK-101). The nucleotide sequence were determined by using the 
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dRhodamine Terminator Cycle Sequencing kit ( ABI /Per kin Elmer #4303141 ) 
on a DNA Sequencer (ABI PRISM 377). For each gene, ten independent 
clones of recombinants were selected, and the nucleotide sequence 
of all the inserts were determined. All clones of each gene showed 
5 a single nucleotide sequence. 

The obtained sequences were verified to be a partial sequence 
of GTAR11-1, GTAR11-2, GTAR11-3, and GTAR11-4 genes, and confirmed 
not to be a non-specific product of RT-PCR. 

(4) Cloning of full length cDNA by 5'- and 3 '-RACE 
10 ( i ) 5 ' -RACE 

To isolate full length cDNAs of GTAR11 genes, 5 '-RACE-PCR was 
performed. Primers used for amplification of GTARll-1, GTAR11-2, 
GTAR11-3, and GTARll-4 genes were "11-1-A2," "11-2-A1," "11-3-Al," 
and "11-4-Al" for first PCR, and "11-1-A3," "11-2-A2," "11-3-A2," 
15 and "11-4-A2" for second PCR, respectively. 

Human Testis Marathon-Ready cDNA Library (Clontech #7414-1) was 
used as a template, and the Advantage cDNA Polymerase Mix was used 
for PCR. PCR was performed on a thermal cycler (Perkin Elmer Gene 
Amp PCR System 2400) under the following conditions: 
20 first PCR: a single cycle of 94°C for 4 min, sequentially followed 

by 5 cycles of 94°C for 20 sec and 72°C for 2 min, 5 cycles of 94°C 
for 20 sec and 70°C for 2 min, 30 cycles of 94°C for 20 sec and 68°C 
for 2 min, and a single cycle of 72°C for 4 min followed by termination 
at 4°C; 

25 second PCR: a single cycle of 94°C for 4 min , sequentially followed 

by 5 cycles of 94°C for 20 sec and 72°C for 2 min, 28 cycles of 94°C 
for 20 sec and 68°C for 2 min, and a single cycle of 72°C for 4 min 
followed by termination at 4°C. 

The obtained 5 ' -RACE-PCR products, detected as a single band, 

30 were subcloned into the pGEM-T Easy vector, six independent clones 
of recombinants were selected for each, and the nucleotide sequence 
of all inserts were determined by using the dRhodamine Terminator 
Cycle Sequencing kit (ABI/Perkin Elmer #4303141) on a DNA Sequencer 
(ABI PRISM 377), as described above. 

35 (ii) 3 '-RACE 

3 ' -RACE-PCR was further performed to isolate full length cDNAs 
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of GTAR11 genes . Primers used for amplification of GTARll-1 , GTAR11-2 , 
GTARll-3, and GTARll-4 genes were "11-1-S1," "11-2-S1," "11-3-S1," 
and "11-4-S1" for first PCR, and "11-1-S2," "11-2-S2," "11-3-S2," 
and "11-4-32" for second PCR, respectively. 
5 Again, Human Testis Marathon-Ready cDNA Library (Clontech 

#7414-1 ) was used as a template . PCR was performed using the Advantage 
cDNA Polymerase Mix under similar conditions to that of 5'-RACE-PCR 
described above, and PCR products of a single size were obtained. 
As described above, the obtained products were subcloned into the 

10 pGEM-T Easy vector, six independent clones of recombinants were 
selected for each, and the nucleotide sequence of all inserts were 
determined by using the dRhodamine Terminator Cycle Sequencing kit 
(ABI/Perkin Elmer #4303141) on a DNA Sequencer (ABI PRISM 377) . All 
six clones of each gene showed a single sequence. The nucleotide 

15 sequences obtained by 5 '-RACE and 3 ' -RACE were combined to generate 
the sequence of full length cDNA of each GTARll gene* The resulting 
nucleotide sequences of cDNAs are shown in Figure 24 (GTARll-1 ) , Figure 
25 (GTAR11-2), Figure 26 (GTAR11-3), and Figure 27 (GTARll-4), and 
SEQ ID NO: 2 4 (GTARll-1), SEQ ID NO: 25 (GTAR11-2), SEQ ID NO: 2 6 

20 (GTAR11-3), and SEQ ID NO: 27 (GTARll-4). 

The GTAR1 1-1 cDNA obtained by the first 3 ' -RACE-PCR had a deletion 
at the C-terminus , and thus was not full length ( the nucleotide sequence 
of the cDNA and the amino acid sequence of the peptide encoded by 
the cDNA are shown in SEQ ID NO: 32, and SEQ ID NO: 33, respectively) . 

25 Consequently, 3 ' -RACE-PCR was performed again for GTARll-1, and the 
full length cDNA of GTARll-1 was obtained as described above. 

The genome of known OR gene family shares a unique structural 
feature that the ORF from the initiation codon to the stop codon is 
contained in a single exon of the genome sequence encoding known OR 

30 genes. The isolated full length cDNAs of GTAR1 1-1 , GTAR11-2, GTARll-3, 
and GTARll-4 genes also have their whole ORF within a sequence that 
is supposed to be a single exon. 

Figure 2 8 shows comparison of the amino acid sequences of known 
human 0LF1, OLF2 , and OLF3, and GTARll-1. Figure 29 shows comparison 

35 of the amino acid sequences of known human OLF2 and OLF3, and GTARll-2 . 
Figure 30 shows comparison of the amino acid sequences of known human 
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0LF2 and 0LF3, and GTAR11-3. Figure 31 shows comparison of the amino 
acid sequences of known human 0LF2 and 0LF3, and GTAR11-4. 

Industrial Applicability 
5 The present invention provides novel G protein-coupled receptor 

proteins and their genes. According to their particular expression 
patterns and such, the receptor proteins of the present invention 
appear to be involved in immune response, hematopoiesis , etc. Thus, 
one can use these proteins in screening assays, for developing drugs 
10 that can regulate immune response and/or hematopoietic cells. Using 
the receptor proteins of the present invention, one can also screen 
for ligands capable of binding to the receptor proteins of the invention , 
such ligands also being potentially involved in immune response, 
hematopoiesis, and so on. 
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CLAIMS 



1 . AG protein-coupled receptor protein selected f romthe group 
consisting of: 

5 (a) a protein having an amino acid sequence selected from the 

group consisting of SEQ ID NO: 4, 5, 6, 28, 29, 30, and 31; 

(b) a protein of ( a ), wherein one or more amino acids are modified 
by deletion, addition, insertion, and/or substitution by another amino 
acid residue; and 

10 (c) a protein encoded by DNA that hybridizes with DNA having 

a nucleotide sequence selected from the group consisting of SEQ ID 
NO: 1, 2, 3, 24, 25, 26, and 27. 

2 . A fusion protein comprising the protein of claim 1 and another 
peptide or polypeptide. 

15 3. A peptide comprising a part of the protein of claim 1. 

4. A DNA encoding the protein or peptide of any of claims 1 

to 3, 

5. A vector comprising the DNA of claim 4 inserted therein. 

6 . A transf ormant carrying the DNA of claim 4 in an expressible 
20 manner . 

7 . A method of producing the protein or peptide of any of claims 
1 to 3, comprising the steps of: cultivating the transf ormant of claim 
6, and recovering the protein or peptide expressed therein. 

8. A method of screening for a compound that is capable of 
25 binding to the protein of claim 1, comprising the steps of: 

(a) exposing a test sample to the protein or peptide of any of 
claims 1 to 3 , and 

(b) selecting the compound that binds to the protein or peptide 
of any of claims 1 to 3 . 

30 9. A method of screening for a ligand and/or agonist that is 

capable of binding to the protein of claim 1, comprising the steps 
of: 

(a) exposing a test sample to a cell expressing the protein or 
peptide of any of claims 1 to 3 on its surface, 
35 (b) measuring a biochemical change in said cell, and 

(c) selecting the compound that induces said biochemical change 
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in said cell. 

10. An antibody that binds to the protein of claim 1. 

11 . A method of detecting or measuring the protein or peptide 
of any of claims 1 to 3 , comprising the steps of: exposing the antibody 

5 of claim 10 to a sample which is assumed to comprise said protein 
or peptide, and detecting or measuring the generation of an immune 
complex between said antibody and said protein or peptide. 

12 . ADNAof a length of 15 nucleotides or longer that hybridizes 
with a DNA having a nucleotide sequence selected from the group 
10 consisting of SEQ ID NO: 1, 2, 3, 24, 25, 26, 27, and their complementary 
strands . 
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ABSTRACT 



Partial cDNA sequences of target genes are obtained by performing 
RT-PCR with the use of an oligonucleotide primer designed based on 

5 a human genome sequence containing novel G protein-coupled receptor 
genes identified within a database. Further, full-length cDNAs of 
these genes are successfully isolated by the 5 ' -RACE method and 3 ' -RACE 
method , with the use of a human testis-origin cDNA library as a template . 
The proteins encoded by thus isolated genes have the characteristics 

10 of known olfactory receptor (OR) gene family. Moreover, the expression 
properties of them suggest that these genes may have functions relating 
to immune response and hemopoiesis. 
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Figure 3 
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1 ATGGACAGTC TAAACCAAAC AAGAGTGACT GAATTTGTCT TCTTGGGACT 
. „ . fr» 

51 CACTGATAAC CGGGTGCTGG AAATGCTGTT TTTCATGGCA TTCTCAGCCA 

101 TTTATATGCT AACGCTTTCA GGGAACATTC TCATCATCAT TGCCACAGTC 

151 TTTACTCCAA GTCTCCATAC CCCCATGTAT TTCTTCCTGA GCAATCTGTC 

201 CTTTATTGAC ATCTGCCACT CATCTGTCAC TGTGCCTAAG ATGTTGGAGG 

251 GTTTGCTTTT AGAAAGAAAG ACCATTTCCT TTGACAACTG CATCACACAG 

301 CTCTTCTTCC TACATCTCTT TGCCTGTGCC GAGATCTTTC TGCTGATCAT 

351 TGTGGCGTAT GATCGTTACG TGGCTATCTG CACTCCACTC CACTACCCCA 

401 ATGTGATGAA CATGAGAGTC TGTATACAGC TTGTCTTTGC TCTCTGGTTG 

451 GGGGGTACTG TTCACTCACT AGGGCAGACC TTC TTGACTA TTCGTCTACC 

501 TTACTGTGGC CCCAACATTA TTGACAGCTA CTTCTGTGAT GTGCCTCTTG 

551 TTATCAAGCT GGCCTGCACA GATACATACC TCACAGGAAT ACTGATTGTG 

601 ACCAATAGTG GAACCATCTC CCTCTCCTGT TTCTTGGCCG TGGTCACCTC 

651 CTATATGGTC ATCCTGGTTT CTCTTCGAAA ACACTCAGCT GAAGGGCGOC 

701 AGAAAGCCCT GTCTACCTGC TCGGCCCACT TCATGGTGGT TGCCCTCTTC 

751 TTTGGGCCAT GTATCTTCAT CTATACTCGG CCAGACACCA GCTTCTCCAT 

801 TGACAAGGTG GTGTCTGTCT TCTACACAGT GGTCACCCCT TTGCTGAATC 

851 CCTTCATTTA CACCTTGAGG AATGAGGAGG TAAAAAGTGC CATGAAGCAG 

901 CTCAGGCAGA GACAAGTTTT TTTCACGAAA TCATATACAT AA 

M 
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101 


TCTACATCCT 


GACTCAGCTG 
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151 
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CCTAATGGCC 


TATGACAGGT 


ACCTGGCAAT 


ATGTCAGCCC 
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TGGAGCCTGG 


451 


ATGGCAGGAT 
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CCTTCCGCCT 


501 


GCCCTACTGT 
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CTTCTTCTGT 


GACATCCCTG 
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CAGTGTTGAG 
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GCTGACACAA 


CAGTCAACGA 


GCTGGTGACG 


601 
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TTGGGGTGGT 


GGTTGCCAGT 


TGCTTCTCCC 


TGATCCTCCT 


651 


CTCCTACATA 


CAGATCATTC 


AGGCCATCCT 


GAGAATCCAC 


ACAGCTGATG 


701 


GGCGGCGCCG 


GGCTTTTTCA 


ACTT GTGGAG 


CCCATGTAAC 


CGTGGTCACC 


751 


GTGTACTATG 


TGCCCTGTGC 


CTTCATCTAC 


CTGAGGCCTG 


AAACCAACAG 


801 


CCCCCTGGAT 


GGGGCAGCTG 


CCCTAGTCCC 


CACGGCCATC 


ACTCCTTTCC 


851 


TCAACCCCCT 


TATCTACACT 


CTGCGGAACC 


AAGAGGTGAA 


GCTGGCCCTG 


901 


AAAAGAATGC 


TCAGAAGCCC 



AAGAACTCCG 


AGTGAGGTTT GA 
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1 ATGGGAAAGA CCAAAAACAC ATCGCTGGAT GCCGTGGTGA CAGATTTCAT 

► 

51 TCTTCTGGGT TTGTCTCACC CCCCAAATCT AAGAAGCCTC CTCTTCCTGG 

101 TCTTCTTCAT CATTTACATC CTCACTCAGC TGGGGAACCT GCTCATTCTG 

151 CTCACCATGT GGGCTGACCC GAAGCTCTGT GCTCGCCCCA TGTACATTCT 

201 TCTGGGAGTG CTCTCATTCC TGGACATGTG GCTCTCCTCA GTCACCGTTC 

251 CTCGGCTTAT TTTGGATTTT ACTCCTTCCA TCAAGGCTAT CCCGTTTGGT 

301 GGCTGTGTGG CTCAACTGTA TTTCTTTCAC TTCCTGGGCA GCACCCAGTG 

351 CTTCCTCTAC ACCTTGATGG CCTATGACAG GTACCTAGCA ATATGTCAGC 

401 CCCTGCACTA CCCAGTGCTC ATGAATGGGA GGTTATGCAC AGTCCTTGTG 

451 GCTGGAGCTT GGGTCGCCGG CTCCATGCAT GGGTCTATCC AGGCCACCTT 

501 GACCTTCCGC CTGCCCTACT GTGGGCCCAA TCAGGTGGAT TACTTTATCT 

551 GTGACATCCG CGCAGTATTG AGACTGGCCT GTGCTGACAC AACTGTCAAT 

601 GAGCTTGTGA CCTTTGTGGA CGTCAGGGTA GTGGCCGCCA GTTGCTTCAT 

651 GTTAATTCTG CTCTCCTATG CCAACATAGT CCATGCCATC C TGAAGAT AC 

701 GCACCGCTGA TGGGAGGCGC CGGGCCTTCT CCACCTGTGG CTCCCACCTA 

751 ATCGTGGTCA CAGTCTACTA TGTCCCCTGT ATTTTCATCT AC CTTAGGGC 

801 TGGCTCCAAA GACCCCCTGG ATGGGGCAGC GGCTGTGTTT TACACTGTTG 

851 TCACTCCATT ACTGAACCCC CTCATCTATA C AC TGAGGAA CCAGGAAGTG 

901 AAGTCTGCCC TGAAGAGGAT AACAGCAGGT TGA 

^ 
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Figure 8 
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Figure 10 
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1 CTCATTGAATGGACAGTCTAAACCAAACAAGAGTGACTGAATTTGTCTTCTTGGGACTCA 

MetAspS«rIjeuAsnGlnThrArgValThrGluPh©ValPhoL©uGlyLeuThr 

6 1 CTGATAACCGGGTGCTGGAAATGCTGTTTTTGATGG 

AspAsnAr gValLeuG ItiMetLQu PhePhoMetAlaPheS QrAlalleTyrMetLeuThr 

TM-I 

12 1 CGCTTTCAGGGAACATTCTCATCATCATTGCCACAGTCTTTACTCCAAGTCTCCATACCC 

T.eufierGlyAanlleLeviTlQTleTleAlaThrValPheThrProSerLeuHisThrErQ 

,181 CCATGTATTTCTTCCTGAGCAATCTGTCCTTTATTGACATCTGCCACTCATCTGTCACTG 

HetTyrPhePhelieuSQrAsnLeuSerPhelleAspIleCysHisSerSgrValThrVal 

TM-II 

241 TGCCTAAGATGTTGGAGGGTTTGCTTTTAGAAAGAAAGACCATTTCCTTTGACAACTGCA 

PiiiLysMetLeuGluGlyLeuLeuLeuGluArgLysThrlleSerPheAspAsnCysIle 

301 TCACACAGCTCTTCTTCCTACATCTCTTTGCCTGTGCCGAGATCTTTCTGCTGATCATTG 

ThrGl nT,euPhePheIieuHisLeuPheAlaCysAlaGluTlePheT t euI,fitiIleTleVal 

TM-I 1 1 

361 TGGCGTATGATCGTTACGTGGCTATCTGCACTCCACTCCACTACCCCAATGTGATGAACA 

AlaTyrAspArgTyrValAlaTleCysThT-ProT.euHisTyrProAsnValMetAsnMet 

421 TGAGAGTCTGTATACAGCTTGTCTTTGCTCTCTGGTTGGGGGGT ACTGTTCACTCACTAG 

AraValCysIleGlnLeuValPh^ AlaT.euTrpI^uGlyGlyThrValHisSerLeuGlv 

TM-IV 

481 GGCAGACCTTCTTGACTATTCGTCTACCTTACTGTGGCCCCAACATTATTGACAGCTACT 
GlnThrPheLeuThrlleArgLeuProTyrCysGlyProAsnllelloAspSerTyrPhe 

541 TCTGTGATGTGCCTCTTGTTAT CAAGCTGGC CTGCACAGATACATACCTCACAGG AATAC 
CysAspValProLeuVallleLysLeuAlaCysThrAspThrTyrlieuThrGlyXlfiLeu 

601 TGATTGTGACCAATAGTGGAACCATCTCCCTCTCCTGTTTCTTGGCCGTGGTCACCTCCT 

IleValThrAsnSerGlyThrTleSerT.euSerCysPheLeuAlaValValThrSerTyr 

TM-V 

661 ATATGGTCATCCTGGTTTCTCTTCGAAAACACTCAGCTGAAGGGCGCCAGAAAGCCCTGT 
MfitVolIleLeuValSerLeuArgLysHisSerAlaGluGlyArgGlnLysl 



Figure 11 



1 1/3 1 



09/807132 



721 CTACCTGCTCGGCCCACTTCATGGTGGTTGCCCTCTTCTTTGGGCCATGTATCTTCATCT 

ThrCysgerAlaHisPheMetValValAlaXieuPhePheGlyProCysIleChelleXyr 

TM-VI 

781 ATACTCGGCCAGACACCAGCTTCTCCATTGACAAGGTGGTGTCTGTCTTCTACACAGTGG 

ThY-A^gPT-rtAQpThrgftrPh^RftT-Tl^^ctpTygV^lVfili^erValPheTyrThrValVal 

841 TCACCCCTTTGCTGAATCCCTTCATTTACACCTTGAGGAATGAGGAGGTAAAAAGTGCCA 

ThrProT .mit^uA snProPh^ TleTyrThrLeuArgAsnGluGluValLvsSerAlaMet 
TM-VII 

901 TGAAGCAGCTCAGGCAGAGACAAGTTTTTTTCACGAAATCATATACATAATGGGCATTGG 
Ly sGlnLeuArgGlnAr gGlnValPhePheThrLysSerTyrThr * * * 

961 GATTGCAGACATAATTGCAGCCACATCCTTAATGAAAGAGCAAAAGTAAAGAGTCAAAAT 

1021 CAACTTATATAACTTGGTAAATTAGGTAAAATGGCATAGAGCAGGTCAGATTTCTGCTCA 

1081 TT AAAG ATAAG AACTT AT T C TGT T CAT TAAAGATAAGAACT TAT TAAC T AT T ATT T AAAT 



1141 
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1 ATTCTCTGGGATATGGAAAGAATCAACAGCACACTGTTGACTGCGTTTATCCTGACAGGA 
MetGluArglleAsnSerThrLeuLeuThrAlaPhelleLeuThrGly 
61 ATTCCGTATCCACTCAGGCTAAGGACACTCTTTTTTGTGTTCTTTTTTCTAATCTACATC 
I leP roTyrProLonAraLeuArgThri.eu PhQPheValPhQPhePheLeuTleTyrlle 

121 CTGACTCAGCTGGGAAACCTGCTTATTTTAATCACTGTCTGGGC AGACCCAAGGCTCCAT 
LeuThrGl nLouGl yAsnLeuT^ ullal^ui leThrVal TrpAl aAspPr oAr gLeuHi s 
TM-I 

181 GCCCGCCCCATGTACATCTTTCTTGGTGTTCTCTCAGTCATTGATATGAGCATCTCCTCC 
AlaAr gPrQMetTvrllePheLeuGlyVallieuSerValTleAspMetSerlleSerSer 

TM-I I 

2 41 ATCATTGTCCCTCGCCTCATGATGAACTTCACTTTAGGTGTCAAACCCATCCCATTTGGT 
IlelleValProArgLeuMetMetiAsnPh^Thrr^uGlyVfllT.ysProIleProPheGly 

301 GGCTGTGTTGCTCAACTCTATTTCTATCACTTCCTGGGCAGCACCCAGTGCTTCCTCTAC 
GlyCysValAlaGlaLeuTyrPheTyrHisP^eLieuGlyS^rThT-GlnCysPheLeuTyr 

TM-I II 

361 ACCCTAATGGCCTATGACAGGTACCTGGCAATATGTCAGCCCCTGCGCTACCCTGTGCTC 
Th^Leu MetiAlaTyr AspArgTyrl.euAlaT 1 e^CysGl nP roT^uArgTyrProVa iLeu 

421 ATGACTGCTAAGCTGAGCGCCTTGCTTGTGGCTGGAGCCTGGATGGCAGGATCCATCCAT 
MetThrAlaLys LeuSerAlaLeuLeuValAlaGly AlaTr pMetAlaGlySerlleHis 

TM-IV 

461 GGGGCTCTCCAGGCCATCCTAACCTTCCGCCTGCCCTACTGTGGGCCCAATCAGGTGGAT 
GJ^AlaLeuGlnAlalleLeuThrPheArgLeuProTyrCysGlyProAsnGlnValAsp 

541 TACTTCTTCTGTGACATCCCTGCAGTGTTGAGACTGGCCTGTGCTGACACAACAGTCAAC 
Ty rPhePheCy s As pi 1 ePr oAl aVa lLeuAr gLeu Al aCy s Al aAspT hrThr Va lAs n 

6 01 GAGCTGGTGACGTTTGTAGACATTGGGGTGGTGGTTGCCAGTTGCTTCTCCCTGATCCTC 
GluT.euValThrPheValAspTl^Gl yValValV alAlaSerC ysPheSerLeuIleLeu 

TM-V 

661 CTCTCCTACATACAGATCATTCAGGCCATCCTGAGAATCCACACAGCTGATGGGCGGCGC 
LeuSerT y r I le Glnl lei leGlnAlal leLeuAr gl leHisThrAlaAspGlyArgAr g 
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721 CGGGCTTTTTCAACTTGTGGAGCCCATGTAACCGTGGTCACCGTGTACTATGTGCCCTGT 
Arc yAlaPheSerThrCYaGlyAlaHisVAlThrValVaiyhr ValTyrTyrValProCys 

TM-VI 

781 GCCTTCATCTACCTGAGGCCTGAAACCAACAGCCCCCTGGATGGGGCAGCTGCCCTAGTC 
AlaPhelleTyr LeuArgProGlnThrAsn.qftrPt'oT.enAspGlyAlaAlaAlaLeuVal 

841 CCCACGGCC^TGACT CCTTTCCTCAACC^ 

PrQThrAlaTleThrPrQPheLeuAsnProT.en T 1 ^TyrThrLeuArgAsnGlnGliiVal 
TM-VII 

901 AAG CTGGC CCTG AAAAG AATGCTC AGAAGC C C AAGAAC T C CGAGT G AGGTTTG AAAGTG T 
Ly sLeuAl aLeuLy s Ar gMe t LeuAr gS er P r oAr gT hrP r o Se rG luV a 1 * * * 

961 C TTT CT CCCACT AGGGAAGCTGCCACAATT AGAATTT ATTATAA TGT T TAGG CTT C GGT A 

1021 ACTTTTTTCTTTTCTTCTTGTTTTTTCTCTTTTATATAGCCATACTGTATGATCAAACAC 

1081 AGTTTAAGGTAAAATACTAACTTTCTAACAGTTCCTTAGTATCCTCTCAAGAT AACTCTC 

1141 AGCCACTGCAAGAGTAGAGAATGAGACCAAATTCTCACAAACTAAACCACATTAAACAAT 



1201 



CCAGAAGAAAGAATGCAATAGTGTATTTTCCAATGTCTCAGTAATAAA 
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Figure 14 

1 GGCAACCTAAAAGCAAGCATGGACAGTTCCTTGGTGAATAACCAAAAACAAGATGGAGTC 
61 TCGCTCTGTTGCCCAGGCTGGAGTGTAGTGGCGCCATCTCGGCTCGCTGCGGTCTCCGCC 
12 1 TCCCGGGTTCAGGCGATTCTCCGGCCTCAGCCTCCCGGGTGCGTGGGATTGCAGGAACTA 
181 GAACTAAAGCGAGGTTAATTTCCACAGTGAGAACATGCTCCAGACATCCGAGCACCAGTG 
241 TGGCTCTGGAAACTCCACAGATACCACAGGACTAGAAAATAACTGGACAATGGGATGTTC 
30 1 TATCTTGCCCGAACTGAGGGATATAAAAAGCTCCAAAGACAAAGAAAGTACCATCCACCC 
361 ATCCCAAAAG AAATTATCCTTCCTTCTGAAAATAAGACTGCAAAAAGACATGGGAAAGAC 

MotGlyLysThr 

421 CAAAAACACATCGCTGGATGCCGTGGTGACAGATTTCATTCTTCTGGGTTTGTCTCACCC 
* LysAsnThrSerLeuAspAlaValValThrAspPhelleLouIieuGlyLeuSerHisPro 

481 CCCAAATCTAAGAAGCCTCCTCTTCCTGGTCTTCTTCATCATTTACATCCTCACTCAGCT 
ProAsnLeuArgSerLeu LeuPheLeuValPhePhel lelle T yrl leLeuThrG InLeu 

TM-I 

541 GGGGAACCTGCTCATTCTGCTCACCATGTGGGCTGACCCGAAGCTCTGTGCTCGCCCCAT 
GlyAsnLeuLeuIle LeuLeuThrMetT rpAl aAspProLysLeuCysAlaArgProMet 

601 GTACATTCTTCTGGGAGTGCTCTCATTCCTGGACATGTGGCTCTCCTCAGTCACCGTTCC 
TyrTleLe ut^uGlyValLeuSftrPh^r.^nARpMfttTrpTj^iSerSerValThrValPrQ 

TM-I I 

661 TCGGCTTATTTTGGATTTTACTCCTTCCATC AAGGCTATCCCGTTTGGTGGCTGTGTGGC 
ArgLeuIleLeuAspPheThrProSerlleLysAlalleProPheGlyGlyCysValAla 

721 TCAACTGTATTTCTTTCACTTCCTGGGCAGCACCCAGTGCTTCCTCTACACCTTGATGGC 

GlnLeuTyrPhePheHisPheLRuGlySerThrGlnCysPheLeuTyrThrLeuMetAla 

7 81 CTATGACAGGTACCTAGCAATATGTCAGCCCCTGCACTACCCAGTGCTCATGAATGGGAG 
TyrAspArgTyrLeuAlalleCysGlnProLeuHisTyrProValLeuMetAsnGlyArg 

841 GTTATGCACAGTCCTTGTGGCTGGAGCTTGGGTCGCCGGCTCCATGCATGGGTCTATCCA 
LeuCysThrValLeuVaTAlaGlyAl aTrpValAl nGlySerMetHisGl ySerlle Gln 

TM-IV 

901 GGCCACCTTGACCTTCCGCCTGCCCTACTGTGGGCCCAATCAGGTGGATTACTTTATCTG 
AlaThrLeuThrPheArgLeuProTyrCysGlyProAsnGlnValAspTyrPhelleCys 
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961 TGACATCCGCGCAGTATTGAGACTGGCCTGTGCTGACACAACTGTCAATGAGCTTGTGAC 
AspIleArgAlaValX>euArgLeuAlaCysAlaAspThrThrValAsnGluLeuValThr 

1021 CTTTGTGGACGTCAGGGTAGTGGCCGCCAGTTGCTTCATGTTAATTCTGCTCTCCTATGC 
PhftValAsp ValArg Val ValAlaAlaSerCysPhfiMfit.T.euIlelieuLeuSerTvrAla 

TM-V 

1081 G^ACATAGTCCATGCCATCCTGAAGATACGCACCGCTGATGGGAGGCGCCGGGCCTTCTC 
| AsnllQValHisAlalleLeuLysIleArgThrAlaAspGlyArgArqArgAiaPheSer 

1141 CACCTGTGGCTCCCACCTAATCGTGGTCACAGTCTACTATGTCCCCTGTATTTTCATCTA 
ThrCysGI ySerHisLeuIleValValThrValTy rTyrValPrQC ysIlePhelleT vr 

TM-V1 

1201 CCTTAGGGCTGGCTCCAAAGACCCCCTGGATGGGGCAGCGGCTGTGTTTTACACTGTTGT 
T,euArgAlaGly5erLysAspProLeuAspGly AlaAlaAlaValPheTvrThrValVal 

1261 CACTCCATTACTGAACCCCCTCATCTATACACTGAGGAACCAGGAAGTGAAGTCTGCCCT 
ThrPrpX ^uL^u Agr^r^euIlgTyrThrL^uArgAsnGlnGluValLysSerAlaLeu 

1321 GAAG AGG AT AAC AG C AGG T TG AAGG ACTG AATG AAAAT AAGTAACT ACATC TG CAT CAT T 
Ly sArgl leThrAlaGly * * * 

1381 ATCACTGCCACTCTCTTCAGCTACTGCTGCATGTGACAAATGCCCAATAAA 
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1 CAGCAGCTTGTCCTTCGTCXjA 

51 AAATCCTGGTGAACTTCCTAG^ 

101 TGCATGGTCCAGCTCTTTTTCTTTC 

151 CCTCCTGACTGCCATGGCATATGATCGCrTATGTTC 

201 TGCTTTATAATGCGATCATGTCCTC^ 

251 GCIX^CTTCTTCTTCGGC^ 

351 ATGTTCTTCCCCTCCTCAATCTCTC 

401 CTTCTACTTTTTATCATTC 
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1 CATGGTAGGCAACCTTGGCTTC 

5 1 TCCACACACCAATCTACTATTTCCTCTTCAATC 

101 TGTOACTCCTCTGTTTTCACTCCCAAAATGC 

151 AAAGAATATTATCTCCAATGTTGGGTC 

201 TCTTTTTCGTCATCTCTGAATGTTACATG 

251 CGCTATXnx^CCATCTGTAATCCA! 

301 TCAGGTCTGTTCTATGCTCACTTTTGC 

351 GAGCCACGGCCCACACCXX^TGCATGTCT^ 

401 AATATCATTAACCATTACTTGTGTGACATACTCCCCCTCCTC 

451 CTGCACCAGCACCTATGTCAACGAGGTGGTTGTTCTC 

501 CTAATATCACGGTACCCAGTTGTACCATC 

551 GTCACTAGCATTCTTCATATCAAATCCACTC AAGGAAGATCAAAAGCCTT 

601 CAGTACTT^TAGCTCTCATCTCATTC 
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1 CATGGTAGGCA&CCTTGGCTTGATC 

51 TCCACACACCAATGTACTATTTCCTCTTCAATCT^ 

101 TGTTACTCCTCTGTTTTCACTCC^ 

151 AAAGAATATTATCTCCTATGTOGGGTGCATGACTCAGCTC 

201 TCTTTTTTGTCATCTCTGAATCCTAC 

251 OTCTATGTX^CCATCTGTAATCCATTGCTGT^ 

301 TCAGGTCTGTTCTATGCTCACT^ 

351 GAGCCACGGCCCACACCGGGTGCATGC 

» 401 AATATCATCAACCATTACTTOT 

451 CTGCACCAGCACCTATGTCAACGAGGTGGTO 

501 TTAATATCATGGTACC CAGTTGTACC ATCCTC ATTTCTTATGTTTTC 

551 GTCACTAGCATTCTTC ATATCAAATCC ACTC AAGG AAGATCAAAAGCCTT 

601 CAGTACTTGTAGCTCTCATGTCATTC 
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1 CCTTTATAGATCTCTCTTATTCCTC 

51 GACTTTGTTTCAGAAAGTATCATCTC 

101 ATTTTTCTTCTGTTTCT^ 

151 TGGCCTATGATCGCTATGTGGCCATCTGCAACCCCCTG 

201 ACCATGTCCCCAAGGGTCTGCTTTC 

251 AGGGTTTGCTGGGGCCATGGCCCACACTGGAAGCATGCTC 

3 01 TCTGTGATTCCAACGTCATTGACCATTATCTGTGTC 

3 51 TTGCAGCTCTCCTGCACCAGCACCCATC 

' 401 TGTTGTTGGAGTAATC ACC ATGCTATC C AGO ATAAGC ATCGTCATCTCTT 

451 ACGCTTTGATACTCTCCAACATC 

501 TCCAAAGCC 
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1 AAATGCCTAAAGAAGAATGACCATGGAAAATTATTCTATGGCAGCTCAGTTTGTCTTAGA 

MetThrMotGluAsnTyrSerMetAlaAlaGlnPheValLeuAsp 

61 TGGTTTAACACAGCAAGCAGAGCTCCAGCTGCCCCTCTTCCTCCTGTTCCTGGGAATCTA 
GlyT.fliiThirfll rtfll n&l afZl nT.^iinl nT.ftiiPT-oT r q T iP hftI.anT.ttU PhaLftiifil yT 1 t*Tyr 

121 TGTGGTCACAGTAGTGGGCAACCTGGGCATGATTCTCCTGATTGCAGTCAGCCCTCTACT 
ValVfllThrValValGlyARnT.miftlyMft^^ 
TM-I 

181 TCACACCCCCATGTACTATTTCCTCAGCAGCTTGTCCTTCGTCGATTTCTGCTATTCCTC 
His ThrPmMfl^TyT-Ty rPhQT J e^S«rflftr^fil1SflrPhaVfl^ AspPhaCysTyrSerSer 

TM-I I 

241 TGTCATTACTCCCAAAATGCTGGTGAACTTCCTAGGAAAGAAGAATACAATCCTTTACTC 
Val Tl ftThT-PT-f^T.ynMftt^T.^iiV^I aanPhflT^uGl yT.y fiLy BAnnThr I ] eLeuTyrSnr 



301 



361 



TGAGTGCATGGTCCAGCTCTTTTTCTTTGTGGTCTTTGTGGTGGCTGAGGGTTACCTCCT 
GluCysMetValGl nT.flUPhftPhePheVA 1 V« l PhftVa iVa 1 A T aGl uGl yTyrT.fiiit.eu 

TM-III 

G AC TGC C ATGGCATATGATCGCTATGTTGCC ATC T GTAGCC C AC TGC TTTAT AATGCGAT 
ThrAl aMehAl afyrt^argTyrU/i 1 Al wTl PPysSarProLeuLeuTyrAfinAl alle 



421 CATGTCCTCATGGGTCTGCTCACTGCTAGTGCTGGCTGCCTTCTTCTTGGGCTTTCTCTC 
MetSerSerTrpValr yflRfirLeuLflul/alr.enAl aAlaPhflPheLftiiG) yPhaLfliiSfir 

TM-IV 

481 TGC C TTG AC TC ATAC AAGTGC C ATG ATG AAAC TGT C CTTTTGC AAATC C C AC ATT AT C AA 

AlaXjeuThrHisThrSerAlaMetMetLysLeuSerPheCysLysSerHi3lleIleAsn 

541 CCATTACTTCTGTGATGTTCTTCCCCTCCTCAATCTCTCCTGCTCCAACACACACCTCAA 
HisTyrPheCysAspValLeuproLeuLeuAsnLeuSerCysSerAsnThrHlsLouAsn 

601 TGAGC TT C T AC T TTTT ATC AT TG C GGGGTTT AAC AC CT TGGTGCC C AC CC T AGCTG TTG C 

^lu r.euLQuT,^»PhftTlQT)eA lflG1 yPh«A3nThrlifiiiValPrQThrLeuAl aValA la 

TM-V 

661 TGTCTCCTATGCCTTCATCCTCTACAGCATCCTTCACATCCGCTCCTCAGAGGGCCGGTC 
ya 1 SerTyrftl a^PKfiT ^ ftT.aiiTyrK^ r-T 1 ALauHial 1 aArgS^rSorGluGlyArgSeir 

721 CAAAGCTTTTGGAACATGC AGCTCTCATCTCATGGCTGTGGTGATCTTCTTTGGGTCCAT 

LysAlflPheGlyThrCys.Snrfif^rH.i nT ift »MatAl aValVal IlePheFheGlySarlla 

TM-VI 

781 TACCTTCATGTATTTCAAGCCCCCTTCAAGTAACTCCCTGGACCAGGAGAAGGTGTCCTC 
ThrPhfi MetTyrPhaLy.gPrnPrnSflr.qflrARnSerLeuAspGlnGluLysValSerSer 

841 TGTGTTCTACACCACGGTGATCCCCATGCTGAACCCTTTAATATACAGTCTGTAATCACA 
ValPheTyrThrThrVal TlePr nMet-T^n AsnProLeuTleTyrSerLe U*** 
TM-VI X 

901 GCACTTTGGAAGGCTGAGGCAGGGTTGCTTGAGTCCAGTTTGAGACCATCCTGGGGAAC A 

961 TAGTGCGATCTTGTTTCTTTCCACTGCCTAAAAACTTCAATGCTCAATTTTACTTGCAAT 



1021 



TTCCTCTTCCTGACATGGAGAATGTTGGCTTGGAATGTTC 
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Figure 25 

1 ATTTTTGAAGACAAAAAATGCTGGCTAGAAACAACTC 

MstLeuAl aAr gAsnAs nS erLeuVa lThrGluPhel laLeuAl a 

61 CTGGATTAAGAGATCXSTCCAGAGTT^ 

m yT^»ThrfcflpAr ? Prnn1 nPh^rpnl ^P™pftftPhftPHflT.ftMPhftTrfMJVal Tl f*TY r 

121 ACATTGTCACCATGGTAGGCAACCTTGGCTTGAT^ 

T 1 ftVn 1 Thi-Mii Wn 1 tttvAfinT.ftiiG! yT,«ti T 1 «ThrT.fluPhaGlvr,QuAflnSerHiaL^u 
TH-I 

181 TCCACACACCAATGTACTATTT<XrrCTTC^ 

Hi^hrPr^QtTvY-^rPh^T^nPhAAgnT.ftnfi^rPhRT 1 MAgpT.ftuCyHTyT-SfirSftr 

TH-II 

241 C TGTTTTCACTCCCAAAATGCTAATGAACT^ 

Va 1 PheTThrPrQT. y aMatT^TiMftl- AanPh^Va 1 SarT t ysT.y sAanT lei leSer AanVal 

» 

301 TTGGGTGCATGACTCGGCTGTTTTTCrr^^ 

GlvCvsMetThrAraLeuPhftPhAPh^T^nPhfiPhflValT 1 oS6rGluCvsTvrMftt-_I,ftu 

TM-III 

361 TGACCTCAATGGCATATGATCGCTATGTGGCCATCTGTA^ 

ThrS&rMetAlaTyrA spArgTyrVa 1 A 1 al leCy aAanProLQUl^euTyrliysValThr 

421 C CATGTC CCATCAjGGTCTGTTCTATGCTCAC 

MeHSerHisGlnVaT CvflR^t-W^1-T t 6nThrPhfiAl nAlaTyrT 1 aMafcGl vT^auAl «G1 y 

TM-IV 

481 GAGCCACGGCCCACACCGGGTGCATGTTT^ 

AT aThrAlHH i^hrGlyPyftMftl-PhfiAT-fjT.ftTiThi-PhQTygSerAl aAanTloTI ftAsri 

541 ACCATTACTTGTGTG&CATACTCCC^ 

HisTyrliOuCyaAspIleLeuProIiexiIiGuGlnLeuSerCysThrSerThrTyrValA^ 

601 ACGAGGTGGTTGTTCTCATTGTTGTGGGTACTAATAT 

GluVa 1 Va 1 Va 1 f.ftn T 1ft Va 1 ValGl yThi-AanTl flThrVa iProSerCvflThjrT leT^u 

TM-V 

661 TCATTTCITPATGTTTTCATTGTCACTAGC^ 

T lflSer^TyrVal PhftTl ftVal ThrSarTl ftT^uHi gTlQT.yflSarThrGlnGlyArgSer 

721 CAAAAGCCTTCAGTACTTGTAGCTCTCA 

T.vsAl aPhASftrThrTyaS^T-^ftrHisValTl rAI aT.etiSftrl.euPhePheGl ySarAl a 

TH-VI 

781 CGGCATTCATGTATATTAAATATTCTTCTCGATC 

ftlflPhQMfit.TyrllQLysTyrSQrSerGlySerMetGluGlnGlyLysVfllPhft^ 

841 TTTTCTACACTAATGTGGTGCCCATGCTCA^ 

PheTvrThrAsnVal Va I ProM^tT.fttiAfmPT-nr.fln T 1 eTyrSerLQUArQAanLvsAsp 

m-vii 

901 ATGTC^AAGTTGCACTGAGGAAAGCTCT 

ValLysValAlaLeuArgLysAlaLeuI laLysIleGlnAxgArgAsnllePhe* * * 

961 TAGAAGCAGTAATGATGTAAAAC^^TTGAAGGACTTCAAATCT 
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GAAGAGATTTTGTTGTTTCTACAGATGGTGTTATGTGT^TTTAATAAA 
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i ATTTTTGAAGACAAAAAATGCTGGCTAGAAACAACTC CTT AGT TTG 

3tDeuAlaArgAsnAsnSerLeuValThrGluPheIleLeuAla 
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Met 



CTGGATTAACAGATCGTCCJtf^^ 
GlyLeuThrAspArgProGluPheArgGlnProTiftnFhftPhflLeVlFhfiTifiUVfllllem 

ACATTGTCACCATGGTAQGCAACCTTGGCTTGATCATTCM 

t 1 ftVfl 1 ThrMe^v« 1 C51 vARTiT .^nrri y t.aiiT 1 laLauPheGlvLeuAsnSerHisLeu 
TH-I 

TCCACACACCAATGTACTATTTCCTCTTCAATCTCTC 

H-i g Thr Prf^fttTYT^r PhftT. ft iiPh6A R hT. f >tifiQrPhftTlpAimT,ffl 

TM-II 

C TGTTTTCACTCC CAAAATGCT AATGAACTTTCT 
VmlPhftThrProL vsMetLeuMetAsnPheValgerLysLysAsnl lei leSerTyrVal 

301 TTGGGTGCATGACTCAGCTGTTTTTCTTTCT 

TM-II I 

361 IX^CCTCAATGGCATATGATCGCTATGTGGCCATCTGTAATC 

X h r fiftrMft^la TvrAsi^aTvrValAlaIleCyaAsnProLeuLeuTyr^ 

CCATGTCCCATCAGGTCTGTTCTATGCT 

M^^Hi ^l nVi>1CvsSer M W l-T,«iThrPhnAlaAlaTVrTleHetfilYT i flUAl flGlY 

TM-IV 

GAGC CACGGC CCACACCGGGTGCATGCTTAGACTCAC CTTCTGCAGTGCTAATATCATCA 
ftl flThrMnH i sThr Gly Cy sMetLeuAr gLeuThrPheCy s Ser Al aAsnl le I le Asn 

ACGATTACTTGTGTGACATACTCCCCCTCCT 

HisTyrLeuCysAspIleLeuProLexiLeuGlnLeuSerCysThrSerThrTyrValAsn 

601 ACGAGGTGGTTGTTCT CATTGTTGTGGGTATTAATAT CATGGTACCCAGTTGTAC CATCC 

TK-V 

661 TCATTTCTTATGTTTTCATTGTCACTAGCA 

T 1 flfiBrTvrVa l Phe I le Va iThr S er I leLeuHisI leLy sSerThrGlnGlyArgSer 

721 CAAAAGCCTTCAGTACTTGTAGCTCTCATGT 

T.y ^l a PhRSftrThrCV3S«T-fini-Hi aV wl Tl fiAlal^tiSerLeuPhflPhftGlvSecAlfl 

TM-VI 

781 CGGCATTCATGTATATTAAATATTCTTCTGGATCTATGGAGCAGGGAAAAG 

fti ^Ph^M^tTvr ll^LvsTvrSerSerGlvSerMetGluGlnGIyliysVfll fiflrSfirVal 

TTTTCTACACTAATGTGGTGCCCATGCTCAATCCTC 

PhfiTyrThrAsnVal Val PrnMfttJ.ei iftpnP^nT.eiiTl eTvrSarLeuArqAsnLyaAsp 

TM-VI I 

ATGTCAAAGTTGC ACTGAGGAAAGCTCTGATTAAAATTCAGAG AAGAAATATAT TCT AAT 
ValLy sValAlaLeuArgLy sAlaLeuI leLy si leGlnArgArgAsnl lePhe * ** 

9 6 1 TAGAAGCAGTAATAATGTAAAACX5ATTGAAGAACTTTAAATTTTTATTAGTGTC 
1021 G AAGAGATTTTGTTGTTTCTAC AGATGGTGTTATGTGTGATTT AATAAA 
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Figure 27 



1 ACAGCTCGCCAAGATAGAATG^^ 

MetThrLeuArgAsnSerSerSerValThrGluPhelleLeu 

6 1 GTGGGATTATCAGAAC AGC CAGAGCTCC AGC TCCCTCTTTTC CTTCTATTCTTAGGG ATC 
ValGlyl^uSerCluGlnProGluLeuGlnLeuProI^nPhftTiftUliftnPhRTifiUGlVlle 

121 TATGTGTTCACTGTGGTGGGGAACTTC 

T yrVn 1 PheThrVn 1 Va 1G1 yB.nnT.AnGl vTrfmT 1 fVThrT.fflll IfiGlyl leAsnProSer 
TH-I 

181 CTTCACACCCCCATGTACTTTTTCCTC 

LeuHi BThi *Pr-nMftt.TyrPhePhQ T .«»PheAanLpnSflrPheI 3 rft nnl i euCVaTYrSsr 

TH-II 

241 TGTGTGTTTACCCCCAAAATGCTGAATGACTTTC 

ry nv^ i PhPiThrProI. vsMetLeuAsnAspPheValSerGluSer I lei leSerTyrVal 

GGATGTATGACTCAGCTATTTTTCTTCTG 



301 



481 



541 



661 



TM-III 



361 GTATCAATGGCCTATGATCGCTATGTGGCCATCTGOUVCCCCCTGCTCT^ 

\7^1 .qftrMfttiAl aTvrA spAraTvrValAlalleCysAsnProLeuLeuTyrMetValThr 

421 ATGTCCCCAAGGGTCTGCTTTCTGCTGATGTTTGGTTCCTATG^ 
M^<^rP™Ar gV a lCy S PheI .»nT.^ 

TH-IV 

GCCATGG CCC ACACTGGAAGCATGC TGC GACTGAC CTTCTGTG ATTC C AAC GT C ATTGAC 
Al aMfitAT a HisThrGlvSerMetl^euArgLeuThrPheCysAspSerAsnVallleAsp 

CATTATCTGTGTGACGTTCTCCCCCTCTTGCAGCTCTCCTGCACCAGCACCCATGTCAGT 
HisTyrLeuCysAspValLeuProLeoLeuGlnLeuSerCysThrSerThrHisValSer 

601 GAGCTGGTATTTTTCATTGTTGTTGGAGTAATCACC^ 

ttl nT.miV a 1 PhePhftT 1 eVal V fl 1G1 yVal Tl ^ThrMetLeilSfirSer ll fiffnr T 1 QVfl l 



TM-V 



ATCTCTTACGCTTTGATACTCTCCAACATCCTCT 

pfi^rTyrAI aL euIleLeuSerAsnllelieuCvsIleProSerAlaGluGlyArgSer 

721 AAAGCCTTTAGCACATGGGGCTCCCACATAATTGCTGTTGCTCTG 

T-y Q t 1 APh^^rThrTr^GlvS e rH ^T^Tl^AlflVfll AlaLfiMPhaPheG] VSflrt i lY 

TM-VI 

781 ACATTCACCTACTTAACAACATCTTTTCCTGGCTCTATGAACCATGGCA 

^hT-PheThrTvr LeuThrThrSerPhQProGlySerMetAsnHisGlyArgFhQAlaSQr 

841 GTCTTTTACACCAATGTGGTTCCCATGCTTAACCCTTCGATCTACAGTTTC 
V ^PheT yrThrAsnValValP r "M ft t^ 
TM-VI I 

90 1 GATCATAAACTTGCCCTGGGCAAAACCCTGAAGA 

As pAspLy sLeuAlaLeuG lyLy sThr LeuLy sAr g Va lLeuP he * * * 



961 



TATCACTGGCAACCGA 



Figure 28 
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OLFl HEF-fD-RBYT -LVTEFIM*<5 FPTRPEJjQIV WLMFfcTIOfA ISLICJtflGLM RiPPtffcQ 

OLF2 M 0— gQS S-TPGFUJ+Q FSEHPGLGRT LFVDVITSYL LTLVGKTLII LL. ALDTKI^ 

OLF3 MG-TD— HOT -WVSEFIL&G LSSDWD2RVS L^VLFLVMYV VTVLO^CLIV LL- RLDSEliS 

11_1 h — TME-fiYS M-AAQFVLDG LTQQAELQLP LFLLFLGIYV VTWGKLGMI LI AVSPIXH 
+ * * * * ** * ***** 

OLFl TPHYFFLSKL SFVDLCYFSD LVPKMLVNFL SENK3ISYYG fcALQFYFFCT FADTESFII^A 

OLF2 BimrrhSWU STODLCfrTTS CW<»l£JVNLW GPKKTISFLD fiSVQlFIFLS LGTTECILMK 

OLF3 TPHYFFLTtfL SfcVDVSYATS WPQIXAHFL AEHKAIPFQS CAAQLTFSLA LGGISFVt&A 

11-1 TPMYYFLSSL SFVDFCYSSV ITPKM^VNFL GKKHTXLYSE fiMVQLFFFW FWAKGYUT 

*** ****** ** * ** * 

OLFl AMAYfiRXVAI CHPLLYTVVM SRGICMRLIV LSYLGGNMSS LVHTSFAFIL KYCDKNVXKH 

OLF2 VMAFBHIVAV CO?X*BYATII HPRLCWQLAS VAWVIGLVGS WQtPSTLHL PFfiPDRQVDD 

OLF3 VMAYftEXVAV CDALRYSAIM HGGL CARINA I TSWVSGFISS FVQTAITFQt* PMfiRNKFIDH 

11-1 AMAYfiBXVAI CSPLLYNAIM SSWVCSLLVL AAFFLGFLSA LTHTSAMMKL SFfiKSHIINH 



** ***** 



OLFl FFjCDLPP&LK LSfiTOTTIHB WLLSTYGSSV EIICFIIIII SYFFILLSVL KIRSFSGRKK 

OLF2 FVCEVPAXtlR LS£Ep*F3Yjtl? IQVAVASVFI LWPLSLILV 5YGAITWAVI, RJMSATAWRK 

OLF3 ISfiELLAWR LAfiVDTSSNE VTIMVSSIVL LMTPLCLVLL SYIQIISTIL KIQSREGRKK 

11-1 YFfcDVLPLLN LS£S*fTHLNE LLLFIIAGFN TLVPTLAVAV 6YAFILYSIL H*R0SEGRSK 

* ***** ******* 

OLFl TFSTCASHS/F SVTIYQGTLL FlYSRpSYLY SPNTDKIISV FYTIFIPVLN PLXVSIilQIKD 

OLF2 AFCTCSSHLT WTLFYSSVI AVYLQPKNPY AQGRdKFFGL FYAVGTPSLK PLVYT£RNKE 

OLF3 AJTHTCASHLT WALCYGVAI FTYIQPHSSP SVLQEKLFSV FYAILTPMLH PKXYSI^NKE 

11-1 AFG£CSSB£M AWIFFGSIT FMVFKPPSSN SLDQEKVSSV FYTTVIPM&N Vhl*$t>~ 

* ** *** * * * * ******** 

OLFl VKDAAEKVLR SKVDS — S 

OLF2 IKRALRRLLG KERDSRESWR AA 

OLF3 VKGAWQKI^W KFSG-LTSKL AT 

11-1 



Figure 29 
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0LF2 
0LF3 
11-2 

0LF2 
0LF3 
11-2 

0LF2 
0LF3 
11-2 

0LF2 
OLF3 
11-2 

OLF2 
OLF3 
11-2 

OLF2 
0LF3 
11-2 



M— DKQ 
MGT-DKQ 
MLARW 
* * 
SPMYFFLSNL 
TPMYFFLTNL 
TPMYYFLFNL 
*** ** ** 
VMAFDRYVAV 
VMAYDRYVAV 

SHAYDRYVA1 
** ***** 

FVCEVPALIR 
ISCELLAWR 
YLCDILPLLQ 
* 

AFGTCSSHLT 
AFHTCASHLT 
AFSTCSSHVI 
** ** ** 

IKRALRRLLG 
VKGAWQKLLW 
VKVALRKALI 
* * * 



SSTPGFLLLG 
TWVSEFILLG 
SLVTEFILAG 
* * * 
SFLDLCFTTS 
SLVDVSYATS 
SFIDLCYSSV 

* * 

CQPLHYATII 
CDALRYSAIM 
CNPLLYKVTM 

* * * 
LSCEDTSYNE 
LACVDTSSNE 
LSCTSTYVNE 
***** 
WTLFYSSVI 
WALCYGVAI 
ALSLFFGSAA 

* 

KERDSRESWR 
KFSGLTSKL 
KIQ-HRH-I 
* 



FSEHPGLGRT 
LSSDWDTRVS 
LTDRPEFWQP 

CVPQMLANLW 
WPQLLAHFL 
FTPKHLHHFV 

* * 
HPRLCWQLAS 
HGGLCARLAI 
SHQVCSMLTF 

* * 
IQVAVASVFI 
VTIMVSSIVL 
VWLIWGTN 

AVYLQPKNPY 
FTYIQPHSSP 
FMYIKYSSG 
* 

AA 
AT 

-F 



LFVDVITSYL 
LFVLFLVMYV 
FFFLFLVIYI 
* * 
GPKKTISFLD 
AEHKAIPFQS 
SKKNIISNVG 
* 

VAWVIGLVGS 
TSWVSGFISS 
AAYIMGLAGA 
* 

LVVPLSL1LV 
LMTPLCLVLL 
ITVPSCTILI 
* * 
AQGRGKFFGL 
SVLQEKLFSV 
SMEQGKVFSV 
* * 



LTLVGNTLI1 
VTVLGNCLIV 
VTMVGNLGLI 
* ** 
CSVQIFIFLS 
CAAQLFFSLA 
CKTRLFFFLF 
* * * 
WQTPSTLHL 
PVQTAITFQL 
TAHTGCMFRL 
* 

SYGAITWAVL 
SYIQIISTIL 
SYVFIVTS1L 
** * * 
FYAVGTPSLN 
FYAILTPMLN 
FYTNWPMLN 



** 



* ** 



LLSALDTKLH 
LLIRLDSRLH 
TLFGLNSHLH 
* ** 
LGTTECILMK 
LGGIEFVLLA 
FVISECYHLT 
* 

PFCPDRQVDD 
PMCRNKFIDH 
TFCSAN1INH 
* 

RINSATAWRK 
KIQSREGRKK 
HIKSTQGRSK 
* * 
PLVYTLRME 
PMIYSLRNKE 
PLIYSLRNKD 
* # **** 



Figure 3 0 
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0LF2 M — DNQ SSTPGFLLLG FSEHPGLGRT LFVDVITSYL LTLVGNTLI I LLSALDTKLH 
0LF3 MGT-DNQ TWVSEFILLG LSSDWDTRVS LFVLFLVHYV VTVLGNCLIV LLIRLDSELH 
11-3 MLAR-p SLVTEFILAG LTDRPEPRQP LFFLFLVIYI VTHVGKLGLI ILFGLNSHLH 

* * * * * ** ****** 

0LF2 SPMYFFLSNL SFLDLCFTTS CVPQMLANLW GPKKTISFLD CSVQIFIFLS LGTTECILMK 
0LF3 TPHYFFLTNL SLVDVSYATS WPQLLAHFL AEHKAIPFQS CAAQLFFSLA LGGIEFVLLA 
11-3 TPMYYFLFNL SFIDLCYSSV FTPKMLMNFV SKKNIISYVG CHTQLFFFLF FVISECYILT 
*** ****** ** * **** * 

0LF2 VMAFDRYVAV CQPLHYATI I HPRLCWQLAS VAWVIGLVGS WQTPSTLHL PFCPDRQVDD 
0LF3 VHAYDRYVAV CDALRYSAIM HGGLCARLAI TSWVSGFISS PVQTAITFQL PMCRNKFIDH 
11-3 SMAYDRYVAI CNPLLYKVTM SHQVCSMLTF AAYIHGLAGA TAHTGCHLRL TFCSANIINH 
** ***** *** ** * ** 

0LF2 FVCEVPALIR LSCEDTSYNE IQVAVASVFI LWPLSLILV SYGAITWAVL RINSATAWRK 
0LF3 ISCELLAWR LACVDTSSNE VTIHVSSIVL LHTPLCLVLL SYIQ1ISTIL KIQSREGRKK 
11-3 YLCDILPLLQ LSCTSTYVNE WVLIWGIN IMVPSCTILI SYVFIVTSIL HIKSTQGRSK 
* ***** * ** *** 

0LF2 AFGTCSSHLT WTLFYSSVI AVYLftPKNPY AQGRGKFFGL FYAVGTPSLN PLVYTLRNKE 
0LF3 AFHTCASHLT WALCYGVAI FTYIQPHSSP SVLQEKLFSV FYAILTPMLN PHIYSLRNKE 
11-3 AFSTCSSHVI ALSLFFGSAA FHYIKY-SSG SMEQGKVSSV FYTNWPMLN PLIYSLRNKD 
****** * * * ******* **** 

0LF2 IKRALRRLLG KERDSRESWH AA 
0LF3. VKGAWQKLLW KFSGL-TSKL AT 
11-3 VKVALRKALI KIQ-RRN— I -F 
* * * * 



Figure 31 
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0LF2 M — DNQ SSTPGPLLLG FSEHPGLGRT LFVDVITSYL LTLVGNTLII LLSALDTKLH 
0LF3 MGT-DNQ TWVSEFILLG LSSDWDTRVS LFVLFLVHYV VTVLGNCLIV LLIRLDSRLH 
11-4 MTLR-KS SSVTEFILVG LSEQPELQLP LFLLFLGIYV FTWGNLGLI TLIGINPSLH 
* * * * * * ** * * ** * ** 

0LF2 SPMYFFLSNL SFLDLCFTTS CVPQMLANLW GPKKTISFLD CSVQIFIFLS LGTTECILMK 
0LF3 TPMYFFLTNL SLVDVSYATS WPQLLAHFL AEHKAIPFQS CAAQLFPSLA LGGIEFVLLA 
11-4 TPMYFFLFNL SFIDLCYSCV FTPKMLHDFV SES-IISYVG CKTQLFFFCF FVKSECYVLV 

****** ** * * * * * * * * * 

0LF2 VMAFDRYVAV CQPLHYATII HPRLCWQLAS VAWV1GLVGS WQTPSTLHL PFCPDRQVDD 
0LF3 VHAYDRYVAV CDALRYSAIM HGGLCARLAI TSWVSGFISS PVQTAITFQL PMCRNKFIDH 
11-4 SHAYDRYVAI CNPLLYMVTH SPRVCFLLMF GSYWGFAGA MAHTGSMLRL TFCDSNVIDH 

** ***** *** ** ** * ** * 

0LF2 FVCEVPALIR LSCEDTSYNE IQVAVASVFI LWPLSLILV SYGAITWAVL RINSATAWRK 
0LF3 ISCELLAWR LACVDTSSNE VTIMVSSIVL LMTPLCLVLL SYIQIISTIL KIQSREGRKK 
11-4 YLCDVLPLLQ LSCTSTHVSE LVFFIWGVI TMLSSISIVI SYALILSNIL CIPSAEGRSK 
* * * * * ** * * * 

0LF2 AFGTCSSHLT WTLFYSSVI AVYLQPKNPY AQGRGKFFGL FYAVGTPSLN PLVYTLRNKE 
0LF3 AFHTCASHLT WALCYGVAI FTYIQPHSSP SVLQEKLFSV FYAILTPMLN PHIYSLRNKE 
11-4 AFSTWGSHII AVALFFGSGT FTYLTTSFPG SMNHGRFASV FYTNWPMLN PSIYSLRNKD 
******* * ******* **** 

0LF2 IKRALRRLLG KERDSRESWR AA 
0LF3 VKGAWQKLLW KFSGL-TSKL AT 

11-4 DKLALGKTL- K R— VL -F 

* * * * 
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<141> 
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<150> JP 1998-363537 
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<160> 53 
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<210> 1 
<211> 1143 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (9) . . (947) 

<400> 1 

ctcattga atg gac agt eta aac caa aca aga gtg act gaa ttt gtc ttc 50 
Met Asp Ser Leu Asn Gin Thr Arg Val Thr Glu Phe Val Phe 
1 5 10 

ttg gga etc act gat aac egg gtg ctg gaa atg ctg ttt ttc atg gca 98 
Leu Gly Leu Thr Asp Asn Arg Val Leu Glu Met Leu Phe Phe Met Ala 
15 20 25 30 

ttc tea gec att tat atg eta acg ctt tea ggg aac att etc ate ate 146 
Phe Ser Ala lie Tyr Met Leu Thr Leu Ser Gly Asn He Leu He He 

35 40 45 

att gec aca gtc ttt act cca agt etc cat ace ccc atg tat ttc ttc 194 
He Ala Thr Val Phe Thr Pro Ser Leu His Thr Pro Met Tyr Phe Phe 

50 55 60 

ctg age aat ctg tec ttt att gac ate tgc cac tea tct gtc act gtg 242 
Leu Ser Asn Leu Ser Phe He Asp He Cys His Ser Ser Val Thr Val 

65 70 75 

cct aag atg ttg gag ggt ttg ctt tta gaa aga aag acc att tec ttt 290 
Pro Lys Met Leu Glu Gly Leu Leu Leu Glu Arg Lys Thr He Ser Phe 
80 85 90 
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gac aac tgc ate aca cag etc ttc ttc eta cat etc ttt gec tgt gee 338 
Asp Asn Cys lie Thr Gin Leu Phe Phe Leu His Leu Phe Ala Cys Ala 
95 100 105 110 

gag ate ttt ctg ctg ate att gtg gcg tat gat cgt tac gtg get ate 386 
Glu He Phe Leu Leu lie He Val Ala Tyr Asp Arg Tyr Val Ala He 

115 120 125 

tgc act cca etc cac tac ccc aat gtg atg aac atg aga gtc tgt ata 434 
Cys Thr Pro Leu His Tyr Pro Asn Val Met Asn Met Arg Val Cys He 

130 135 140 

cag ctt gtc ttt get etc tgg ttg ggg ggt act gtt cac tea eta ggg 482 
Gin Leu Val Phe Ala Leu Trp Leu Gly Gly Thr Val His Ser Leu Gly 

145 150 155 

cag acc ttc ttg act att cgt eta cct tac tgt ggc ccc aac att att 530 
Gin Thr Phe Leu Thr lie Arg Leu Pro Tyr Cys Gly Pro Asn lie He 

160 165 170 

gac age tac ttc tgt gat gtg cct ctt gtt ate aag ctg gec tgc aca 578 
Asp Ser Tyr Phe Cys Asp Val Pro Leu Val He Lys Leu Ala Cys Thr 
175 180 185 190 

gat aca tac etc aca gga ata ctg att gtg acc aat agt gga acc ate 626 
Asp Thr Tyr Leu Thr Gly He Leu He Val Thr Asn Ser Gly Thr He 

195 200 205 

tec etc tec tgt ttc ttg gee gtg gtc acc tec tat atg gtc ate ctg 674 
Ser Leu Ser Cys Phe Leu Ala Val Val Thr Ser Tyr Met Val He Leu 

210 215 220 

gtt tct ctt cga aaa cac tea get gaa ggg cgc cag aaa gec ctg tct 722 
Val Ser Leu Arg Lys His Ser Ala Glu Gly Arg Gin Lys Ala Leu Ser 
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225 230 235 

acc tgc teg gec cac ttc atg gtg gtt gec etc ttc ttt ggg cca tgt 770 
Thr Cys Ser Ala His Phe Met Val Val Ala Leu Phe Phe Gly Pro Cys 

240 245 250 

ate ttc ate tat act egg cca gac acc age ttc tec att gac aag gtg 818 
lie Phe He Tyr Thr Arg Pro Asp Thr Ser Phe Ser He Asp Lys Val 
255 260 265 270 

gtg tct gtc ttc tac aca gtg gtc acc cct ttg ctg aat ccc ttc att 866 
Val Ser Val Phe Tyr Thr Val Val Thr Pro Leu Leu Asn Pro Phe He 

275 280 285 

tac acc ttg agg aat gag gag gta aaa agt gec atg aag cag etc agg 914 
Tyr Thr Leu Arg Asn Glu Glu Val Lys Ser Ala Met Lys Gin Leu Arg 

290 295 300 

cag aga caa gtt ttt ttc acg aaa tea tat aca taatgggcat tgggattgea 967 
Gin Arg Gin Val Phe Phe Thr Lys Ser Tyr Thr 

305 310 
gacataattg cagccacatc cttaatgaaa gagcaaaagt aaagagtcaa aatcaactta 1027 
tataacttgg taaattaggt aaaatggcat agagcaggtc agatttctgc tcattaaaga 1087 
taagaactta ttctgttcat taaagataag aacttattaa ctattattta aataaa 1143 



<210> 2 
<211> 1248 
<212> DNA 

<213> Homo sapiens 
<220> 
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<221> CDS 

<222> (13).. (951) 

<400> 2 

attctctggg at atg gaa aga ate aac age aca ctg ttg act gcg ttt ate 51 
Met Glu Arg He Asn Ser Thr Leu Leu Thr Ala Phe lie 
1 5 10 

ctg aca gga att ccg tat cca etc agg eta agg aca etc ttt ttt gtg 99 
Leu Thr Gly He Pro Tyr Pro Leu Arg Leu Arg Thr Leu Phe Phe Val 

15 20 25 

ttc ttt ttt eta ate tac ate ctg act cag ctg gga aac ctg ctt att 147 
Phe Phe Phe Leu He Tyr He Leu Thr Gin Leu Gly Asn Leu Leu He 
30 35 40 45 

tta ate act gtc tgg gca gac cca agg etc cat gec cgc ccc atg tac 195 
Leu He Thr Val Trp Ala Asp Pro Arg Leu His Ala Arg Pro Met Tyr 

50 55 60 

ate ttt ctt ggt gtt etc tea gtc att gat atg age ate tec tec ate 243 
He Phe Leu Gly Val Leu Ser Val He Asp Met Ser He Ser Ser lie 

65 70 75 

att gtc cct cgc etc atg atg aac ttc act tta ggt gtc aaa ccc ate 291 
He Val Pro Arg Leu Met Met Asn Phe Thr Leu Gly Val Lys Pro He 

80 85 90 

cca ttt ggt ggc tgt gtt get caa etc tat ttc tat cac ttc ctg ggc 339 
Pro Phe Gly Gly Cys Val Ala Gin Leu Tyr Phe Tyr His Phe Leu Gly 

95 100 105 

age acc cag tgc ttc etc tac ace eta atg gee tat gac agg tac ctg 387 
Ser Thr Gin Cys Phe Leu Tyr Thr Leu Met Ala Tyr Asp Arg Tyr Leu 
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110 

gca ata tgt cag ccc 
Ala He Cys Gin Pro 
130 

age gec ttg ctt gtg 
Ser Ala Leu Leu Val 
145 

get etc cag gee ate 
Ala Leu Gin Ala He 
160 

cag gtg gat tac ttc 
Gin Val Asp Tyr Phe 
175 

tgt get gac aca aca 
Cys Ala Asp Thr Thr 
190 

gtg gtg gtt gee agt 
Val Val Val Ala Ser 
210 

ate att cag gee ate 
He He Gin Ala He 
225 

get ttt tea act tgt 
Ala Phe Ser Thr Cys 
240 

gtg ccc tgt gec ttc 



115 120 
ctg cgc tac cct gtg etc 
Leu Arg Tyr Pro Val Leu 
135 

get gga gec tgg atg gca 
Ala Gly Ala Trp Met Ala 
150 

eta ace ttc cgc ctg ccc 
Leu Thr Phe Arg Leu Pro 
165 

ttc tgt gac ate cct gca 
Phe Cys Asp lie Pro Ala 
180 

gtc aac gag ctg gtg acg 
Val Asn Glu Leu Val Thr 
195 200 
tgc ttc tec ctg ate etc 
Cys Phe Ser Leu He Leu 
215 

ctg aga ate cac aca get 
Leu Arg He His Thr Ala 
230 

gga gec cat gta acc gtg 
Gly Ala His Val Thr Val 
245 

ate tac ctg agg cct gaa 



125 

atg act get aag ctg 435 
Met Thr Ala Lys Leu 
140 

gga tec ate cat ggg 483 
Gly Ser He His Gly 
155 

tac tgt ggg ccc aat 531 
Tyr Cys Gly Pro Asn 
170 

gtg ttg aga ctg gee 579 
Val Leu Arg Leu Ala 
185 

ttt gta gac att ggg 627 
Phe Val Asp He Gly 
205 

etc tec tac ata cag 675 
Leu Ser Tyr He Gin 
220 

gat ggg egg cgc egg 723 
Asp Gly Arg Arg Arg 
235 

gtc acc gtg tac tat 771 
Val Thr Val Tyr Tyr 
250 

acc aac age ccc ctg 819 
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Val Pro Cys Ala Phe He Tyr Leu Arg Pro Glu Thr Asn Ser Pro Leu 

255 260 265 

gat ggg gca get gec eta gtc ccc acg gee ate act cct ttc etc aac 867 
Asp Gly Ala Ala Ala Leu Val Pro Thr Ala lie Thr Pro Phe Leu Asn 
270 275 280 285 

ccc ctt ate tac act ctg egg aac caa gag gtg aag ctg gec ctg aaa 915 
Pro Leu He Tyr Thr Leu Arg Asn Gin Glu Val Lys Leu Ala Leu Lys 

290 295 300 

aga atg etc aga age cca aga act ccg agt gag gtt tgaaagtgtc 961 
Arg Met Leu Arg Ser Pro Arg Thr Pro Ser Glu Val 

305 310 
tttctcccac tagggaagct gecacaatta gaatttatta taatgtttag getteggtaa 1021 
cttttttctt ttcttcttgt tttttctctt ttatatagee atactgtatg atcaaacaca 1081 
gtttaaggta aaatactaac tttctaacag ttccttagta tcctctcaag ataactctca 1141 
gccactgcaa gagtagagaa tgagaccaaa ttctcacaaa ctaaaccaca ttaaacaatc 1201 
cagaagaaag aatgeaatag tgtattttcc aatgtctcag taataaa 1248 

<210> 3 

<211> 1431 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (410). . (1339) 
<400> 3 

ggcaacctaa aagcaagcat ggacagttcc ttggtgaata accaaaaaca agatggagtc 60 
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tcgctctgtt gcccaggctg gagtgtagtg gcgccatctc ggctcgctgc ggtctccgcc 120 
tcccgggttc aggcgattct ccggcctcag cctcccgggt gcgtgggatt gcaggaacta 180 
gaactaaagc gaggttaatt tccacagtga gaacatgctc cagacatccg agcaccagtg 240 
tggctctgga aactccacag ataccacagg actagaaaat aactggacaa tgggatgttc 300 
tatcttgccc gaactgaggg atataaaaag ctccaaagac aaagaaagta ccatccaccc 360 
atcccaaaag aaattatcct tccttctgaa aataagactg caaaaagac atg gga aag 418 

Met Gly Lys 
1 

» 

acc aaa aac aca teg ctg gat gec gtg gtg aca gat ttc att ctt ctg 466 
Thr Lys Asn Thr Ser Leu Asp Ala Val Val Thr Asp Phe He Leu Leu 

5 10 15 

ggt ttg tct cac ccc cca aat eta aga age etc etc ttc ctg gtc ttc 514 
Gly Leu Ser His Pro Pro Asn Leu Arg Ser Leu Leu Phe Leu Val Phe 
20 25 30 35 

ttc ate att tac ate etc act cag ctg ggg aac ctg etc att ctg etc 562 
Phe He He Tyr lie Leu Thr Gin Leu Gly Asn Leu Leu He Leu Leu 

40 45 50 

acc atg tgg get gac ccg aag etc tgt get cgc ccc atg tac att ctt 610 
Thr Met Trp Ala Asp Pro Lys Leu Cys Ala Arg Pro Met Tyr He Leu 

55 60 65 

ctg gga gtg etc tea ttc ctg gac atg tgg etc tec tea gtc acc gtt 658 
Leu Gly Val Leu Ser Phe Leu Asp Met Trp Leu Ser Ser Val Thr Val 

70 75 80 

cct egg ctt att ttg gat ttt act cct tec ate aag get ate ccg ttt 706 
Pro Arg Leu He Leu Asp Phe Thr Pro Ser He Lys Ala He Pro Phe 
85 90 95 
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ggt ggc tgt gtg get 
Gly Gly Cys Val Ala 
100 

cag tgc ttc etc tac 
Gin Cys Phe Leu Tyr 
120 

tgt cag ccc ctg cac 
Cys Gin Pro Leu His 
135 

gtc ctt gtg get gga 
Val Leu Val Ala Gly 
150 

cag gec acc ttg acc 
Gin Ala Thr Leu Thr 
165 

gat tac ttt ate tgt 
Asp Tyr Phe lie Cys 
180 

gac aca act gtc aat 
Asp Thr Thr Val Asn 
200 

gec gec agt tgc ttc 
Ala Ala Ser Cys Phe 
215 

cat gec ate ctg aag 
His Ala lie Leu Lys 



caa ctg tat ttc ttt cac 
Gin Leu Tyr Phe Phe His 
105 110 
acc ttg atg gec tat gac 
Thr Leu Met Ala Tyr Asp 
125 

tac cca gtg etc atg aat 
Tyr Pro Val Leu Met Asn 
140 

get tgg gtc gee ggc tec 
Ala Trp Val Ala Gly Ser 
155 

ttc cgc ctg ccc tac tgt 
Phe Arg Leu Pro Tyr Cys 
170 

gac ate cgc gca gta ttg 
Asp He Arg Ala Val Leu 
185 190 
gag ctt gtg acc ttt gtg 
Glu Leu Val Thr Phe Val 
205 

atg tta att ctg etc tec 
Met Leu lie Leu Leu Ser 
220 

ata cgc acc get gat ggg 
He Arg Thr Ala Asp Gly 



ttc ctg ggc age acc 754 
Phe Leu Gly Ser Thr 
115 

agg tac eta gca ata 802 
Arg Tyr Leu Ala lie 
130 

ggg agg tta tgc aca 850 
Gly Arg Leu Cys Thr 
145 

atg cat ggg tct ate 898 
Met His Gly Ser He 
160 

ggg ccc aat cag gtg 946 
Gly Pro Asn Gin Val 
175 

aga ctg gee tgt get 994 
Arg Leu Ala Cys Ala 
195 

gac gtc agg gta gtg 1042 
Asp Val Arg Val Val 
210 

tat gee aac ata gtc 1090 
Tyr Ala Asn He Val 
225 

agg cgc egg gee ttc 1138 
Arg Arg Arg Ala Phe 
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230 235 240 

tec acc tgt ggc tec cac eta ate gtg gtc aca gtc tac tat gtc ccc 1186 
Ser Thr Cys Gly Ser His Leu He Val Val Thr Val Tyr Tyr Val Pro 

245 250 255 

tgt att ttc ate tac ctt agg get ggc tec aaa gac ccc ctg gat ggg 1234 
Cys He Phe He Tyr Leu Arg Ala Gly Ser Lys Asp Pro Leu Asp Gly 
260 265 270 275 

gca gcg get gtg ttt tac act gtt gtc act cca tta ctg aac ccc etc 1282 
Ala Ala Ala Val Phe Tyr Thr Val Val Thr Pro Leu Leu Asn Pro Leu 

280 285 290 

ate tat aca ctg agg aac cag gaa gtg aag tct gec ctg aag agg ata 1330 
He Tyr Thr Leu Arg Asn Gin Glu Val Lys Ser Ala Leu Lys Arg He 

295 300 305 

aca gca ggt tgaaggactg aatgaaaata agtaactaca tetgeatcat 1379 
Thr Ala Gly 
310 

tatcactgcc actctcttca getactgetg catgtgacaa atgeccaata aa 1431 

<210> 4 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Asp Ser Leu Asn Gin Thr Arg Val Thr Glu Phe Val Phe Leu Gly 

15 10 15 

Leu Thr Asp Asn Arg Val Leu Glu Met Leu Phe Phe Met Ala Phe Ser 
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20 25 30 

Ala lie Tyr Met Leu Thr Leu Ser Gly Asn He Leu He He He Ala 

35 40 45 

Thr Val Phe Thr Pro Ser Leu His Thr Pro Met Tyr Phe Phe Leu Ser 

50 55 60 

Asn Leu Ser Phe He Asp He Cys His Ser Ser Val Thr Val Pro Lys 

65 70 75 80 

Met Leu Glu Gly Leu Leu Leu Glu Arg Lys Thr He Ser Phe Asp Asn 
» 

85 90 95 

Cys He Thr Gin Leu Phe Phe Leu His Leu Phe Ala Cys Ala Glu He 

100 105 110 

Phe Leu Leu He He Val Ala Tyr Asp Arg Tyr Val Ala He Cys Thr 

115 120 125 

Pro Leu His Tyr Pro Asn Val Met Asn Met Arg Val Cys He Gin Leu 

130 135 140 

Val Phe Ala Leu Trp Leu Gly Gly Thr Val His Ser Leu Gly Gin Thr 
145 150 155 160 

Phe Leu Thr He Arg Leu Pro Tyr Cys Gly Pro Asn He He Asp Ser 

165 170 175 

Tyr Phe Cys Asp Val Pro Leu Val He Lys Leu Ala Cys Thr Asp Thr 

180 185 190 

Tyr Leu Thr Gly He Leu He Val Thr Asn Ser Gly Thr He Ser Leu 

195 200 205 

Ser Cys Phe Leu Ala Val Val Thr Ser Tyr Met Val He Leu Val Ser 

210 215 220 

Leu Arg Lys His Ser Ala Glu Gly Arg Gin Lys Ala Leu Ser Thr Cys 
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225 230 235 240 

Ser Ala His Phe Met Val Val Ala Leu Phe Phe Gly Pro Cys He Phe 

245 250 255 

He Tyr Thr Arg Pro Asp Thr Ser Phe Ser He Asp Lys Val Val Ser 

260 265 270 

Val Phe Tyr Thr Val Val Thr Pro Leu Leu Asn Pro Phe He Tyr Thr 

275 280 285 

Leu Arg Asn Glu Glu Val Lys Ser Ala Met Lys Gin Leu Arg Gin Arg 

' 290 295 300 

Gin Val Phe Phe Thr Lys Ser Tyr Thr 
305 310 

<210> 5 

<211> 313 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Glu Arg He Asn Ser Thr Leu Leu Thr Ala Phe He Leu Thr Gly 

15 10 15 

He Pro Tyr Pro Leu Arg Leu Arg Thr Leu Phe Phe Val Phe Phe Phe 

20 25 30 

Leu He Tyr He Leu Thr Gin Leu Gly Asn Leu Leu He Leu He Thr 

35 40 45 

Val Trp Ala Asp Pro Arg Leu His Ala Arg Pro Met Tyr He Phe Leu 

50 55 60 

Gly Val Leu Ser Val He Asp Met Ser He Ser Ser He He Val Pro 
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65 70 75 80 

Arg Leu Met Met Asn Phe Thr Leu Gly Val Lys Pro He Pro Phe Gly 

85 90 95 

Gly Cys Val Ala Gin Leu Tyr Phe Tyr His Phe Leu Gly Ser Thr Gin 

100 105 110 

Cys Phe Leu Tyr Thr Leu Met Ala Tyr Asp Arg Tyr Leu Ala He Cys 

115 120 125 

Gin Pro Leu Arg Tyr Pro Val Leu Met Thr Ala Lys Leu Ser Ala Leu 

' 130 135 140 

Leu Val Ala Gly Ala Trp Met Ala Gly Ser He His Gly Ala Leu Gin 
145 150 155 160 

Ala He Leu Thr Phe Arg Leu Pro Tyr Cys Gly Pro Asn Gin Val Asp 

165 170 175 

Tyr Phe Phe Cys Asp He Pro Ala Val Leu Arg Leu Ala Cys Ala Asp 

180 185 190 

Thr Thr Val Asn Glu Leu Val Thr Phe Val Asp He Gly Val Val Val 

195 200 205 

Ala Ser Cys Phe Ser Leu He Leu Leu Ser Tyr He Gin He He Gin 

210 215 220 

Ala He Leu Arg He His Thr Ala Asp Gly Arg Arg Arg Ala Phe Ser 
225 230 235 240 

Thr Cys Gly Ala His Val Thr Val Val Thr Val Tyr Tyr Val Pro Cys 

245 250 255 

Ala Phe He Tyr Leu Arg Pro Glu Thr Asn Ser Pro Leu Asp Gly Ala 

260 265 270 

Ala Ala Leu Val Pro Thr Ala lie Thr Pro Phe Leu Asn Pro Leu He 
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275 280 285 

Tyr Thr Leu Arg Asn Gin Glu Val Lys Leu Ala Leu Lys Arg Met Leu 

290 295 300 

Arg Ser Pro Arg Thr Pro Ser Glu Val 
305 310 



<210> 6 
<211> 310 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Gly Lys Thr Lys Asn Thr Ser Leu Asp Ala Val Val Thr Asp Phe 

15 10 15 

lie Leu Leu Gly Leu Ser His Pro Pro Asn Leu Arg Ser Leu Leu Phe 

20 25 30 

Leu Val Phe Phe He He Tyr He Leu Thr Gin Leu Gly Asn Leu Leu 

35 40 - 45 

He Leu Leu Thr Met Trp Ala Asp Pro Lys Leu Cys Ala Arg Pro Met 

50 55 60 

Tyr He Leu Leu Gly Val Leu Ser Phe Leu Asp Met Trp Leu Ser Ser 
65 70 75 80 

Val Thr Val Pro Arg Leu He Leu Asp Phe Thr Pro Ser He Lys Ala 

85 90 95 

He Pro Phe Gly Gly Cys Val Ala Gin Leu Tyr Phe Phe His Phe Leu 

100 105 110 

Gly Ser Thr Gin Cys Phe Leu Tyr Thr Leu Met Ala Tyr Asp Arg Tyr 
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115 120 125 

Leu Ala He Cys Gin Pro Leu His Tyr Pro Val Leu Met Asn Gly Arg 

130 135 140 

Leu Cys Thr Val Leu Val Ala Gly Ala Trp Val Ala Gly Ser Met His 
145 150 155 160 

Gly Ser He Gin Ala Thr Leu Thr Phe Arg Leu Pro Tyr Cys Gly Pro 

165 1?0 175 

Asn Gin Val Asp Tyr Phe He Cys Asp He Arg Ala Val Leu Arg Leu 

180 185 190 

Ala Cys Ala Asp Thr Thr Val Asn Glu Leu Val Thr Phe Val Asp Val 

195 200 205 

Arg Val Val Ala Ala Ser Cys Phe Met Leu He Leu Leu Ser Tyr Ala 

210 215 220 

Asn He Val His Ala He Leu Lys He Arg Thr Ala Asp Gly Arg Arg 
225 230 235 240 

Arg Ala Phe Ser Thr Cys Gly Ser His Leu He Val Val Thr Val Tyr 

245 250 255 

Tyr Val Pro Cys He Phe He Tyr Leu Arg Ala Gly Ser Lys Asp Pro 

260 265 270 

Leu Asp Gly Ala Ala Ala Val Phe Tyr Thr Val Val Thr Pro Leu Leu 

275 280 285 

Asn Pro Leu He Tyr Thr Leu Arg Asn Gin Glu Val Lys Ser Ala Leu 

290 295 300 

Lys Arg He Thr Ala Gly 
305 310 
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<210> 7 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400>7 

ATGGACAGTC TAAACCAAAC AAGAGTG 27 

<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 8 

ATGGCATTCT CAGCCATTTA TATGCTA 27 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 9 

GGGAACATTC TCATCATCAT TGCCACA 27 
<210> 10 

i 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 10 

TTATGTATAT GATTTCGTGA AAAAAAC 27 

<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 11 

TACCTCCTCA TTCCTCAAGG TGTAAAT 27 

<210> 12 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 12 

GGTGACCACT GTGTAGAAGA CAGACAC 27 

<210> 13 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 13 

ATGGAAAGAA TCAACAGCAC ACTGTTG 27 



<210> 14 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 14 

TCTAATCTAC ATCCTGACTC AGCTGGG 27 

<210> 15 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 15 

TTCAAACCTC ACTCGGAGTT CTTGGGC 27 

<210> 16 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 16 

AGCTTCACCT CTTGGTTCCG CAGAGTG 27 

<210> 17 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 17 

ATGGGAAAGA CCAAAAACAC ATCGCTG 27 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 18 
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CGTGGTGACA GATTTCATTC TTCTGGG 27 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

t 

<223> Artificially Synthesized Primer Sequence 
<400> 19 

TCAACCTGCT GTTATCCTCT TCAGGGC 27 

<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 20 

CCTGGTTCCT CAGTGTATAG ATGAGGG 27 



<210> 
<211> 



21 
942 
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<212> DNA 

<213> Homo sapiens 
<400> 21 

atggacagtc taaaccaaac aagagtgact 
cgggtgctgg aaatgctgtt tttcatggca 
gggaacattc tcatcatcat tgccacagtc 
ttcttcctga gcaatctgtc ctttattgac 
atgttggagg gtttgctttt agaaagaaag 
ctcttcttcc tacatctctt tgcctgtgcc 
gatcgttacg tggctatctg cactccactc 
tgtatacagc ttgtctttgc tctctggttg 
ttcttgacta ttcgtctacc ttactgtggc 
gtgcctcttg ttatcaagct ggcctgcaca 
accaatagtg gaaccatctc cctctcctgt 
atcctggttt ctcttcgaaa acactcagct 
tcggcccact tcatggtggt tgccctcttc 
ccagacacca gcttctccat tgacaaggtg 
ttgctgaatc ccttcattta caccttgagg 
ctcaggcaga gacaagtttt tttcacgaaa 

<210> 22 
<211> 942 
<212> DNA 

<213> Homo sapiens 
<400> 22 

atggaaagaa tcaacagcac actgttgact 



gaatttgtct 


tcttgggact 


cactgataac 


60 


ttctcagcca 


tttatatgct 


aacgctttca 


120 


tttactccaa 


gtctccatac 


ccccatgtat 


180 


atctgccact 


catctgtcac 


tgtgcctaag 


240 


accatttcct 


ttgacaactg 


catcacacag 


300 


gagatctttc 


tgctgatcat 


tgtggcgtat 


360 


cactacccca 


atgtgatgaa 


catgagagtc 


420 


gggggtactg 


ttcactcact 


agggcagacc 


480 


cccaacatta 


ttgacagcta 


cttctgtgat 


540 


gatacatacc 


tcacaggaat 


actgattgtg 


600 


ttcttggccg 


tggtcacctc 


ctatatggtc 


660 


gaagggcgcc 


agaaagccct 


gtctacctgc 


720 


tttgggccat 


gtatcttcat 


ctatactcgg 


780 


gtgtctgtct 


tctacacagt 


ggtcacccct 


840 


aatgaggagg 


taaaaagtgc 


catgaagcag 


900 


tcatatacat 


aa 942 







gcgtttatcc tgacaggaat tccgtatcca 60 
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ctcaggctaa 


ggacactctt 


ttttgtgttc 


ggaaacctgc 


ttattttaat 


cactgtctgg 


tacatctttc 


ttggtgttct 


ctcagtcatt 


cgcctcatga 


tgaacttcac 


tttaggtgtc 


caactctatt 


tctatcactt 


cctgggcagc 


tatgacaggt 


acctggcaat 


atgtcagccc 


ctgagcgcct 


tgcttgtggc 


tggagcctgg 


gccatcctaa 


ccttccgcct 


gccctactgt 


gacatccctg 


cagtgttgag 


actggcctgt 


tttgtagaca 


ttggggtggt 


ggttgccagt 


cagatcattc 


aggccatcct 


gagaatccac 


acttgtggag 


cccatgtaac 


cgtggtcacc 


ctgaggcctg 


aaaccaacag 


ccccctggat 


actcctttcc 


tcaaccccct 


tatctacact 


aaaagaatgc 


tcagaagccc 


aagaactccg 



ttttttctaa tctacatcct gactcagctg 120 
gcagacccaa ggctccatgc ccgccccatg 180 
gatatgagca tctcctccat cattgtccct 240 
aaacccatcc catttggtgg ctgtgttgct 300 
acccagtgct tcctctacac cctaatggcc 360 
ctgcgctacc ctgtgctcat gactgctaag 420 
atggcaggat ccatccatgg ggctctccag 480 
gggcccaatc aggtggatta cttcttctgt 540 
gctgacacaa cagtcaacga gctggtgacg 600 
tgcttctccc tgatcctcct ctcctacata 660 
acagctgatg ggcggcgccg ggctttttca 720 
gtgtactatg tgccctgtgc cttcatctac 780 
ggggcagctg ccctagtccc cacggccatc 840 
ctgcggaacc aagaggtgaa gctggccctg 900 
agtgaggttt ga 942 



<210> 23 
<211> 933 
<212> DNA 

<213> Homo sapiens 
<400> 23 

atgggaaaga ccaaaaacac atcgctggat 
ttgtctcacc ccccaaatct aagaagcctc 
ctcactcagc tggggaacct gctcattctg 
gctcgcccca tgtacattct tctgggagtg 
gtcaccgttc ctcggcttat tttggatttt 



gccgtggtga cagatttcat tcttctgggt 60 
ctcttcctgg tcttcttcat catttacatc 120 
ctcaccatgt gggctgaccc gaagctctgt 180 
ctctcattcc tggacatgtg gctctcctca 240 
actccttcca tcaaggctat cccgtttggt 300 



24/66 



ggctgtgtgg 


ctcaactgta 


tttctttcac 


ttcctgggca 


gcacccagtg 


cttcctctac 


360 


accttgatgg 


cctatgacag 


gtacctagca 


atatgtcagc 


ccctgcacta 


cccagtgctc 


420 


atgaatggga 


ggttatgcac 


agtccttgtg 


gctggagctt 


gggtcgccgg 


ctccatgcat 


480 


gggtctatcc 


aggccacctt 


gaccttccgc 


ctgccctact 


gtgggcccaa 


tcaggtggat 


540 


tactttatct 


gtgacatccg 


cgcagtattg 


agactggcct 


gtgctgacac 


aactgtcaat 


600 


gagcttgtga 


cctttgtgga 


cgtcagggta 


gtggccgcca 


gttgcttcat 


gttaattctg 


660 


ctctcctatg 


ccaacatagt 


ccatgccatc 


ctgaagatac 


gcaccgctga 


tgggaggcgc 


720 


cgggccttct 


ccacctgtgg 


ctcccaccta 


atcgtggtca 


cagtctacta 


tgtcccctgt 


780 


attttcatct 


accttagggc 


tggctccaaa 


gaccccctgg 


atggggcagc 


ggctgtgttt 


840 


tacactgttg 


tcactccatt 


actgaacccc 


ctcatctata 


cactgaggaa 


ccaggaagtg 


900 


aagtctgccc 


tgaagaggat 


aacagcaggt 


tga 933 









<210> 24 

<211> 1060 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (17). . (892) 

<400> 24 

aaatgcctaa agaaga atg acc atg gaa aat tat tct atg gca get cag ttt 52 

Met Thr Met Glu Asn Tyr Ser Met Ala Ala Gin Phe 
1 5 10 
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gtc tta gat ggt tta aca cag caa gca gag etc cag ctg ccc etc ttc 

Val Leu Asp Gly Leu Thr Gin Gin Ala Glu Leu Gin Leu Pro Leu Phe 

15 20 25 

etc ctg ttc ctg gga ate tat gtg gtc aca gta gtg ggc aac ctg ggc 

Leu Leu Phe Leu Gly He Tyr Val Val Thr Val Val Gly Asn Leu Gly 
30 35 40 

atg att etc ctg att gca gtc age cct eta ctt cac acc ccc atg tac 

Met He Leu Leu He Ala Val Ser Pro Leu Leu His Thr Pro Met Tyr 

45 50 55 60 

tat ttc etc age age ttg tec ttc gtc gat ttc tgc tat tec tct gtc 

Tyr Phe Leu Ser Ser Leu Ser Phe Val Asp Phe Cys Tyr Ser Ser Val 

65 70 75 

att act ccc aaa atg ctg gtg aac ttc eta gga aag aag aat aca ate 

He Thr Pro Lys Met Leu Val Asn Phe Leu Gly Lys Lys Asn Thr He 

80 85 90 

ctt tac tct gag tgc atg gtc cag etc ttt ttc ttt gtg gtc ttt gtg 

Leu Tyr Ser Glu Cys Met Val Gin Leu Phe Phe Phe Val Val Phe Val 

95 100 105 



gtg get gag ggt tac etc ctg act gec atg gca tat gat cgc tat gtt 
Val Ala Glu Gly Tyr Leu Leu Thr Ala Met Ala Tyr Asp Arg Tyr Val 
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110 115 120 

gcc ate tgt age cca ctg ctt tat aat gcg ate atg tec tea tgg gtc 

Ala lie Cys Ser Pro Leu Leu Tyr Asn Ala lie Met Ser Ser Trp Val 

125 130 135 140 

tgc tea ctg eta gtg ctg get gcc ttc ttc ttg ggc ttt etc tct gcc 

Cys Ser Leu Leu Val Leu Ala Ala Phe Phe Leu Gly Phe Leu Ser Ala 
145 150 155 

ttg act cat aca agt gcc atg atg aaa ctg tec ttt tgc aaa tec cac 

Leu Thr His Thr Ser Ala Met Met Lys Leu Ser Phe Cys Lys Ser His 

160 165 170 

att ate aac cat tac ttc tgt gat gtt ctt ccc etc etc aat etc tec 

He He Asn His Tyr Phe Cys Asp Val Leu Pro Leu Leu Asn Leu Ser 

175 180 - 185 

tgc tec aac aca cac etc aat gag ctt eta ctt ttt ate att gcg ggg 

Cys Ser Asn Thr His Leu Asn Glu Leu Leu Leu Phe lie He Ala Gly 
190 195 200 



ttt aac acc ttg gtg ccc acc eta get gtt get gtc tec tat gcc ttc 
Phe Asn Thr Leu Val Pro Thr Leu Ala Val Ala Val Ser Tyr Ala Phe 
205 210 215 220 
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ate etc tac age ate ctt cac ate cgc tec tea gag ggc egg tec aaa 724 

He Leu Tyr Ser He Leu His He Arg Ser Ser Glu Gly Arg Ser Lys 
225 230 235 

get ttt gga aca tgc age tct cat etc atg get gtg gtg ate ttc ttt 772 

Ala Phe Gly Thr Cys Ser Ser His Leu Met Ala Val Val He Phe Phe 
240 245 250 

ggg tec att ace ttc atg tat ttc aag ccc cct tea agt aac tec ctg 820 

Gly Ser He Thr Phe Met Tyr Phe Lys Pro Pro Ser Ser Asn Ser Leu 
255 260 265 

gac cag gag aag gtg tec tct gtg ttc tac ace acg gtg ate ccc atg 868 

Asp Gin Glu Lys Val Ser Ser Val Phe Tyr Thr Thr Val He Pro Met 

270 275 280 

ctg aac cct tta ata tac agt ctg taatcacagc actttggaag gctgaggcag 922 
Leu Asn Pro Leu He Tyr Ser Leu 
285 290 

ggttgcttga gtccagtttg agaccatcct ggggaacata gtgegatett gtttctttcc 982 

actgectaaa aacttcaatg ctcaatttta ettgeaattt cctcttcctg acatggagaa 1042 



tgttggcttg gaatgttc 



1060 
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<210> 25 

<211> 1069 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (18). . (956) 

<400> 25 

atttttgaag acaaaaa atg ctg get aga aac aac tec tta gtg act gaa 

Met Leu Ala Arg Asn Asn Ser Leu Val Thr Glu 

1 5 10 

ttt att ctt get gga tta aca gat cgt cca gag ttc tgg caa ccc ttc 
Phe He Leu Ala Gly Leu Thr Asp Arg Pro Glu Phe Trp Gin Pro Phe 
15 20 25 

ttt ttc ctg ttc eta gtg ate tac att gtc ace atg gta ggc aac ctt 
Phe Phe Leu Phe Leu Val He Tyr He Val Thr Met Val Gly Asn Leu 
30 35 40 



ggc ttg ate act ctt ttc ggt eta aat tct cac etc cac aca cca atg 
Gly Leu lie Thr Leu Phe Gly Leu Asn Ser His Leu His Thr Pro Met 
45 50 55 
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tac tat ttc etc ttc aat etc tec ttc att gat etc tgt tac tec tct 242 
Tyr Tyr Phe Leu Phe Asn Leu Ser Phe He Asp Leu Cys Tyr Ser Ser 
60 65 70 75 

gtt ttc act ccc aaa atg eta atg aac ttt gtg tea aaa aag aat att 290 
Val Phe Thr Pro Lys Met Leu Met Asn Phe Val Ser Lys Lys Asn He 
80 85 90 

ate tec aat gtt ggg tgc atg act egg ctg ttt ttc ttt etc ttt ttc 338 
He Ser Asn Val Gly Cys Met Thr Arg Leu Phe Phe Phe Leu Phe Phe 
95 100 105 

gtc ate tct gaa tgt tac atg ttg acc tea atg gca tat gat cgc tat 386 
Val He Ser Glu Cys Tyr Met Leu Thr Ser Met Ala Tyr Asp Arg Tyr 
110 115 120 

gtg gec ate tgt aat cca ttg ctg tat aag gtc acc atg tec cat cag 434 
Val Ala He Cys Asn Pro Leu Leu Tyr Lys Val Thr Met Ser His Gin 
125 130 135 

gtc tgt tct atg etc act ttt get get tac ata atg gga ttg get gga 482 
Val Cys Ser Met Leu Thr Phe Ala Ala Tyr He Met Gly Leu Ala Gly 
140 145 150 155 



gec acg gec cac acc ggg tgc atg ttt aga etc acc ttc tgc agt get 
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Ala Thr Ala His Thr Gly Cys Met Phe Arg Leu Thr Phe Cys Ser Ala 
160 165 170 

aat ate att aac cat tac ttg tgt gac ata etc ccc etc etc cag ctt 578 
Asn He He Asn His Tyr Leu Cys Asp lie Leu Pro Leu Leu Gin Leu 
175 180 185 

tec tgc ace age acc tat gtc aac gag gtg gtt gtt etc att gtt gtg 626 
» 

Ser Cys Thr Ser Thr Tyr Val Asn Glu Val Val Val Leu He Val Val 
190 195 200 

ggt act aat ate acg gta ccc agt tgt acc ate etc att tct tat gtt 674 
Gly Thr Asn He Thr Val Pro Ser Cys Thr He Leu He Ser Tyr Val 
205 210 215 

ttc att gtc act age att ctt cat ate aaa tec act caa gga aga tea 722 
Phe He Val Thr Ser He Leu His He Lys Ser Thr Gin Gly Arg Ser 
220 225 230 235 

aaa gec ttc agt act tgt age tct cat gtc att get ctg tct ctg ttt 770 
Lys Ala Phe Ser Thr Cys Ser Ser His- Val He Ala Leu Ser Leu Phe 
240 245 250 



ttt ggg tea gcg gca ttc atg tat att aaa tat tct tct gga tct atg 818 
Phe Gly Ser Ala Ala Phe Met Tyr He Lys Tyr Ser Ser Gly Ser Met 
255 260 265 
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gag cag gga aaa gtt ttt tct gtt ttc tac act aat gtg gtg ccc atg 866 
Glu Gin Gly Lys Val Phe Ser Val Phe Tyr Thr Asn Val Val Pro Met 
270 275 280 

etc aat ccc etc ate tac agt ttg agg aac aag gat gtc aaa gtt gca 914 
Leu Asn Pro Leu He Tyr Ser Leu Arg Asn Lys Asp Val Lys Val Ala 
285 290 295 

» 

ctg agg aaa get ctg att aaa att cag agg aga aat ata ttc 956 
Leu Arg Lys Ala Leu He Lys He Gin Arg Arg Asn He Phe 
300 305 310 

taattagaag cagtaatgat gtaaaacaat tgaaggactt caaattttta ttagtgtttt 1016 
tcatgaagag attttgttgt ttctacagat ggtgttatgt gtgatttaat aaa 1069 

<210> 26 
<211> 1069 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (18). . (956) 
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<400> 26 

atttttgaag acaaaaa atg ctg get aga aac aac tec tta gtg act gaa 

Met Leu Ala Arg Asn Asn Ser Leu Val Thr Glu 
*1 5 10 

ttt att ctt get gga tta aca gat cgt cca gag ttc egg caa ccc etc 
Phe He Leu Ala Gly Leu Thr Asp Arg Pro Glu Phe Arg Gin Pro Leu 
15 20 25 

ttt ttc ctg ttt eta gtg ate tac att gtc ace atg gta ggc aac ctt 
Phe Phe Leu Phe Leu Val He Tyr He Val Thr Met Val Gly Asn Leu 
30 35 40 

ggc ttg ate att ctt ttc ggt eta aat tct cac etc cac aca cca atg 
Gly Leu He He Leu Phe Gly Leu Asn Ser His Leu His Thr Pro Met 
45 50 55 

tac tat ttc etc ttc aat etc tec ttc att gat etc tgt tac tec tct 
Tyr Tyr Phe Leu Phe Asn Leu Ser Phe lie Asp Leu Cys Tyr Ser Ser 
60 65 70 75 



gtt ttc act ccc aaa atg eta atg aac ttt gta tea aaa aag aat att 
Val Phe Thr Pro Lys Met Leu Met Asn Phe Val Ser Lys Lys Asn lie 
80 85 90 
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ate tec tat gtt ggg tgc atg act cag ctg ttt ttc ttt etc ttt ttt 338 
He Ser Tyr Val Gly Cys Met Thr Gin Leu Phe Phe Phe Leu Phe Phe 
95 100 105 

gtc ate tct gaa tgc tac ata ttg acc tea atg gca tat gat cgc tat 386 
Val lie Ser Glu Cys Tyr He Leu Thr Ser Met Ala Tyr Asp Arg Tyr 
110 115 120 

gtg gec ate tgt aat cca ttg ctg tat aag gtc acc atg tec cat cag 434 
Val Ala He Cys Asn Pro Leu Leu Tyr Lys Val Thr Met Ser His Gin 
125 130 135 

gtc tgt tct atg etc act ttt get get tac ata atg gga ttg get gga 482 
Val Cys Ser Met Leu Thr Phe Ala Ala Tyr He Met Gly Leu Ala Gly 
140 145 150 155 

gec acg gee cac acc ggg tgc atg ctt aga etc acc ttc tgc agt get 530 
Ala Thr Ala His Thr Gly Cys Met Leu Arg Leu Thr Phe Cys Ser Ala 
160 165 170 

aat ate ate aac cat tac ttg tgt gac ata etc ccc etc etc cag ctt 578 
Asn He He Asn His Tyr Leu Cys Asp He Leu Pro Leu Leu Gin Leu 
175 180 185 

tec tgc acc age acc tat gtc aac gag gtg gtt gtt etc att gtt gtg 626 
Ser Cys Thr Ser Thr Tyr Val Asn Glu Val Val Val Leu He Val Val 
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190 195 200 

ggt att aat ate atg gta ccc agt tgt acc ate etc att tct tat gtt 674 
Gly He Asn He Met Val Pro Ser Cys Thr He Leu He Ser Tyr Val 
205 210 215 

ttc att gtc act age att ctt cat ate aaa tec act caa gga aga tea 722 
Phe He Val Thr Ser He Leu His He Lys Ser Thr Gin Gly Arg Ser 
220 225 230 235 

aaa gec ttc agt act tgt age tct cat gtc att get ctg tct ctg ttt 770 
Lys Ala Phe Ser Thr Cys Ser Ser His Val He Ala Leu Ser Leu Phe 
240 245 250 

ttt ggg tea gcg gca ttc atg tat att aaa tat tct tct gga tct atg 818 
Phe Gly Ser Ala Ala Phe Met Tyr He Lys Tyr Ser Ser Gly Ser Met 
255 260 265 

gag cag gga aaa gtt tct tct gtt ttc tac act aat gtg gtg ccc atg 866 
Glu Gin Gly Lys Val Ser Ser Val Phe Tyr Thr Asn Val Val Pro Met 
270 275 280 

etc aat cct etc ate tac agt ttg agg aac aag gat gtc aaa gtt gca 914 
Leu Asn Pro Leu lie Tyr Ser Leu Arg Asn Lys Asp Val Lys Val Ala 
285 290 295 
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ctg agg aaa get ctg att aaa att cag aga aga aat ata ttc 



956 



Leu Arg Lys Ala Leu lie Lys lie Gin Arg Arg Asn lie Phe 



300 



305 



310 



taattagaag cagtaataat gtaaaacgat tgaagaactt taaattttta ttagtgtgtt 1016 



ccatgaagag attttgttgt ttctacagat ggtgttatgt gtgatttaat aaa 



1069 



<210> 27 
<211> 976 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (19).. (945) 

<400> 27 

acagctcgcc aagagaga atg act ctg aga aac age tec tea gtg act gag 51 

Met Thr Leu Arg Asn Ser Ser Ser Val Thr Glu 



ttt ate ctt gtg gga tta tea gaa cag cca gag etc cag etc cct ctt 99 
Phe lie Leu Val Gly Leu Ser Glu Gin Pro Glu Leu Gin Leu Pro Leu 



5 



10 



15 



20 



25 
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ttc ctt eta ttc tta ggg ate tat gtg ttc act gtg gtg ggc aac ttg 
Phe Leu Leu Phe Leu Gly He Tyr Val Phe Thr Val Val Gly Asn Leu 
30 35 40 

ggc ttg ate acc tta att ggg ata aat cct age ctt cac ace ccc atg 
Gly Leu He Thr Leu He Gly He Asn Pro Ser Leu His Thr Pro Met 
45 50 55 

tac ttt ttc etc ttc aac ttg tec ttt ata gat etc tgt tat tec tgt 
Tyr Phe Phe Leu Phe Asn Leu Ser Phe He Asp Leu Cys Tyr Ser Cys 
60 65 70 75 



gtg ttt acc ccc aaa atg ctg aat gac ttt gtt tea gaa agt ate ate 
Val Phe Thr Pro Lys Met Leu Asn Asp Phe Val Ser Glu Ser He He 
80 85 90 



tct tat gtg gga tgt atg act cag eta ttt ttc ttc tgt ttc ttt gtc 
Ser Tyr Val Gly Cys Met Thr Gin Leu Phe Phe Phe Cys Phe Phe Val 
95 100 105 



aat tct gag tgc tat gtg ttg gta tea atg gee tat gat cgc tat gtg 
Asn Ser Glu Cys Tyr Val Leu Val Ser Met Ala Tyr Asp Arg Tyr Val 
110 115 120 



gec ate tgc aac ccc ctg etc tac atg gtc acc atg tec cca agg gtc 
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Ala He Cys Asn Pro Leu Leu Tyr Met Val Thr Met Ser Pro Arg Val 
125 130 135 

tgc ttt ctg ctg atg ttt ggt tec tat gtg gta ggg ttt get ggg gec 483 
Cys Phe Leu Leu Met Phe Gly Ser Tyr Val Val Gly Phe Ala Gly Ala 
140 145 150 155 

atg gec cac act gga age atg ctg cga ctg acc ttc tgt gat tec aac 531 
Met Ala His Thr Gly Ser Met Leu Arg Leu Thr Phe Cys Asp Ser Asn 
160 165 170 

gtc att gac cat tat ctg tgt gac gtt etc ccc etc ttg cag etc tec 579 
Val lie Asp His Tyr Leu Cys Asp Val Leu Pro Leu Leu Gin Leu Ser 
175 180 185 

tgc acc age acc cat gtc agt gag ctg gta ttt ttc att gtt gtt gga 627 
Cys Thr Ser Thr His Val Ser Glu Leu Val Phe Phe He Val Val Gly 
190 195 200 

gta ate acc atg eta tec age ata age ate gtc ate tct tac get ttg 675 
Val He Thr Met Leu Ser Ser He Ser He Val He Ser Tyr Ala Leu 
205 210 215 



ata etc tec aac ate etc tgt 
He Leu Ser Asn He Leu Cys 
220 225 



att cct tct gca gag ggc aga tec aaa 
He Pro Ser Ala Glu Gly Arg Ser Lys 
230 235 



723 
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gcc ttt age aca tgg ggc tec cac ata att get gtt get ctg ttt ttt 

Ala Phe Ser Thr Trp Gly Ser His He He Ala Val Ala Leu Phe Phe 
240 245 250 

ggg tea ggg aca ttc acc tac tta aca aca tct ttt cct ggc tct atg 

Gly Ser Gly Thr Phe Thr Tyr Leu Thr Thr Ser Phe Pro Gly Ser Met 

255 260 265 

aac cat ggc aga ttt gcc tea gtc ttt tac acc aat gtg gtt ccc atg 

Asn His Gly Arg Phe Ala Ser Val Phe Tyr Thr Asn Val Val Pro Met 

270 275 280 

ctt aac cct teg ate tac agt ttg agg aat aag gat gat aaa ctt gcc 

Leu Asn Pro Ser He Tyr Ser Leu Arg Asn Lys Asp Asp Lys Leu Ala 
285 290 295 

ctg ggc aaa acc ctg aag aga gtg etc ttc taatgggtct cttcatatca 

Leu Gly Lys Thr Leu Lys Arg Val Leu Phe 

300 305 



ctggcaaccg a 



<210> 28 
<211> 292 
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<212> PRT 

<213> Homo sapiens 
<400> 28 

Met Thr Met Glu Asn Tyr Ser Met Ala Ala Gin Phe Val Leu Asp Gly 
15 10 15 

Leu Thr Gin Gin Ala Glu Leu Gin Leu Pro Leu Phe Leu Leu Phe Leu 
20 25 30 

Gly He Tyr Val Val Thr Val Val Gly Asn Leu Gly Met He Leu Leu 
35 40 45 

He Ala Val Ser Pro Leu Leu His Thr Pro Met Tyr Tyr Phe Leu Ser 
50 55 60 

Ser Leu Ser Phe Val Asp Phe Cys Tyr Ser Ser Val He Thr Pro Lys 
65 70 75 80 

Met Leu Val Asn Phe Leu Gly Lys Lys Asn Thr lie Leu Tyr Ser Glu 
85 90 95 

Cys Met Val Gin Leu Phe Phe Phe Val Val Phe Val Val Ala Glu Gly 
100 105 110 



Tyr Leu Leu Thr Ala Met Ala Tyr Asp Arg Tyr Val Ala He Cys Ser 
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115 120 125 

Pro Leu Leu Tyr Asn Ala lie Met Ser Ser Trp Val Cys Ser Leu Leu 
130 135 140 

Val Leu Ala Ala Phe Phe Leu Gly Phe Leu Ser Ala Leu Thr His Thr 
145 150 155 160 

Ser Ala Met Met Lys Leu Ser Phe Cys Lys Ser His lie lie Asn His 
165 170 175 

Tyr Phe Cys Asp Val Leu Pro Leu Leu Asn Leu Ser Cys Ser Asn Thr 
180 185 190 

His Leu Asn Glu Leu Leu Leu Phe lie lie Ala Gly Phe Asn Thr Leu 
195 200 205 

Val Pro Thr Leu Ala Val Ala Val Ser Tyr Ala Phe He Leu Tyr Ser 
210 215 220 

He Leu His He Arg Ser Ser Glu Gly Arg Ser Lys Ala Phe Gly Thr 
225 230 235 240 



Cys Ser Ser His Leu Met Ala Val Val lie Phe Phe Gly Ser He Thr 
245 250 255 
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Phe Met Tyr Phe Lys Pro Pro Ser Ser Asn Ser Leu Asp Gin Glu Lys 
260 265 270 



Val Ser Ser Val Phe Tyr Thr Thr Val He Pro Met Leu Asn Pro Leu 
275 280 285 



He Tyr Ser Leu 
290 



<210> 29 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<400> 29 

Met Leu Ala Arg Asn Asn Ser Leu Val Thr Glu Phe He Leu Ala Gly 
15 10 15 

Leu Thr Asp Arg Pro Glu Phe Trp Gin Pro Phe Phe Phe Leu Phe Leu 
20 25 30 

Val lie Tyr lie Val Thr Met Val Gly Asn Leu Gly Leu He Thr Leu 
35 40 45 

Phe Gly Leu Asn Ser His Leu His Thr Pro Met Tyr Tyr Phe Leu Phe 
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50 55 60 

Asn Leu Ser Phe He Asp Leu Cys Tyr Ser Ser Val Phe Thr Pro Lys 
65 70 75 80 

Met Leu Met Asn Phe Val Ser Lys Lys Asn lie He Ser Asn Val Gly 
85 90 95 

Cys Met Thr Arg Leu Phe Phe Phe Leu Phe Phe Val He Ser Glu Cys 
100 105 110 

Tyr Met Leu Thr Ser Met Ala Tyr Asp Arg Tyr Val Ala He Cys Asn 
115 120 125 

Pro Leu Leu Tyr Lys Val Thr Met Ser His Gin Val Cys Ser Met Leu 
130 135 140 

Thr Phe Ala Ala Tyr He Met Gly Leu Ala Gly Ala Thr Ala His Thr 
145 150 155 160 

Gly Cys Met Phe Arg Leu Thr Phe Cys Ser Ala Asn He He Asn His 
165 170 175 



Tyr Leu Cys Asp lie Leu Pro Leu Leu Gin Leu Ser Cys Thr Ser Thr 
180 185 190 
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Tyr Val Asn Glu Val Val Val Leu He Val Val Gly Thr Asn He Thr 
195 200 205 

Val Pro Ser Cys Thr He Leu He Ser Tyr Val Phe He Val Thr Ser 
210 215 220 

He Leu His He Lys Ser Thr Gin Gly Arg Ser Lys Ala Phe Ser Thr 
225 230 235 240 

Cys Ser Ser His Val He Ala Leu Ser Leu Phe Phe Gly Ser Ala Ala 
245 250 255 

Phe Met Tyr He Lys Tyr Ser Ser Gly Ser Met Glu Gin Gly Lys Val 
260 265 270 

Phe Ser Val Phe Tyr Thr Asn Val Val Pro Met Leu Asn Pro Leu He 
275 280 ■ 285 

Tyr Ser Leu Arg Asn Lys Asp Val Lys Val Ala Leu Arg Lys Ala Leu 
290 295 300 

He Lys He Gin Arg Arg Asn He Phe 
305 310 



<210> 30 
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<211> 313 
<212> PRT 

<213> Homo sapiens 
<400> 30 

Met Leu Ala Arg Asn Asn Ser Leu Val Thr Glu Phe He Leu Ala Gly 
15 10 15 

Leu Thr Asp Arg Pro Glu Phe Arg Gin Pro Leu Phe Phe Leu Phe Leu 
20 25 30 

Val He Tyr He Val Thr Met Val Gly Asn Leu Gly Leu He He Leu 
35 40 45 

Phe Gly Leu Asn Ser His Leu His Thr Pro Met Tyr Tyr Phe Leu Phe 
50 55 60 

Asn Leu Ser Phe lie Asp Leu Cys Tyr Ser Ser Val Phe Thr Pro Lys 
65 70 75 80 

Met Leu Met Asn Phe Val Ser Lys Lys Asn He He Ser Tyr Val Gly 
85 90 95 



Cys Met Thr Gin Leu Phe Phe Phe Leu Phe Phe Val He Ser Glu Cys 
100 105 110 
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Tyr lie Leu Thr Ser Met Ala Tyr Asp Arg Tyr Val Ala He Cys Asn 
115 120 125 

Pro Leu Leu Tyr Lys Val Thr Met Ser His Gin Val Cys Ser Met Leu 
130 * 135 140 

Thr Phe Ala Ala Tyr He Met Gly Leu Ala Gly Ala Thr Ala His Thr 
145 150 155 160 

Gly Cys Met Leu Arg Leu Thr Phe Cys Ser Ala Asn He lie Asn His 
165 170 175 

Tyr Leu Cys Asp He Leu Pro Leu Leu Gin Leu Ser Cys Thr Ser Thr 
180 185 190 

Tyr Val Asn Glu Val Val Val Leu He Val Val Gly He Asn He Met 
195 200 205 

Val Pro Ser Cys Thr He Leu He Ser Tyr Val Phe He Val Thr Ser 
210 215 220 

He Leu His He Lys Ser Thr Gin Gly Arg Ser Lys Ala Phe Ser Thr 
225 . 230 235 240 



Cys Ser Ser His Val lie Ala Leu Ser Leu Phe Phe Gly Ser Ala Ala 
245 250 255 



46/66 



Phe Met Tyr lie Lys Tyr Ser Ser Gly Ser Met Glu Gin Gly Lys Val 
260 265 270 

Ser Ser Val Phe Tyr Thr Asn Val Val Pro Met Leu Asn Pro Leu lie 
275 280 285 

Tyr Ser Leu Arg Asn Lys Asp Val Lys Val Ala Leu Arg Lys Ala Leu 
290 295 300 

He Lys He Gin Arg Arg Asn He Phe 
305 310 



<210> 31 
<211> 309 
<212> PRT 

<213> Homo sapiens 
<400> 31 

Met Thr Leu Arg Asn Ser Ser Ser Val Thr Glu Phe He Leu Val Gly 
15 10 15 

Leu Ser Glu Gin Pro Glu Leu Gin Leu Pro Leu Phe Leu Leu Phe Leu 
20 25 30 
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Gly He Tyr Val Phe Thr Val Val Gly Asn Leu Gly Leu lie Thr Leu 
35 40 45 

He Gly He Asn Pro Ser Leu His Thr Pro Met Tyr Phe Phe Leu Phe 
50 55 60 

Asn Leu Ser Phe He Asp Leu Cys Tyr Ser Cys Val Phe Thr Pro Lys 
65 70 75 80 

Met Leu Asn Asp Phe Val Ser Glu Ser He He Ser Tyr Val Gly Cys 
85 90 95 

Met Thr Gin Leu Phe Phe Phe Cys Phe Phe Val Asn Ser Glu Cys Tyr 
100 105 110 

Val Leu Val Ser Met Ala Tyr Asp Arg Tyr Val Ala He Cys Asn Pro 
115 120 - 125 

Leu Leu Tyr Met Val Thr Met Ser Pro Arg Val Cys Phe Leu Leu Met 
130 135 140 

Phe Gly Ser Tyr Val Val Gly Phe Ala Gly Ala Met Ala His Thr Gly 
145 150 155 160 



Ser Met Leu Arg Leu Thr Phe Cys Asp Ser Asn Val lie Asp His Tyr 
165 170 175 
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Leu Cys Asp Val Leu Pro Leu Leu Gin Leu Ser Cys Thr Ser Thr His 
180 185 190 

Val Ser Glu Leu Val Phe Phe lie Val Val Gly Val He Thr Met Leu 
195 200 205 

Ser Ser He Ser He Val He Ser Tyr Ala Leu He Leu Ser Asn He 
' 210 215 220 

Leu Cys He Pro Ser Ala Glu Gly Arg Ser Lys Ala Phe Ser Thr Trp 
225 230 235 240 

Gly Ser His He He Ala Val Ala Leu Phe Phe Gly Ser Gly Thr Phe 
245 250 255 

Thr Tyr Leu Thr Thr Ser Phe Pro Gly Ser Met Asn His Gly Arg Phe 
260 265 270 

Ala Ser Val Phe Tyr Thr Asn Val Val Pro Met Leu Asn Pro Ser He 
275 280 285 

Tyr Ser Leu Arg Asn Lys Asp Asp Lys Leu Ala Leu Gly Lys Thr Leu 
290 295 300 



Lys Arg Val Leu Phe 
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305 

<210> 32 
<211> 762 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (17). . (760) 

<400> 32 

aaatgcctaa agaaga atg acc atg gaa aat tat tct atg gca get cag ttt 52 
Met Thr Met Glu Asn Tyr Ser Met Ala Ala Gin Phe 
1 5 10 

gtc tta gat ggt tta aca cag caa gca gag etc cag ctg ccc etc ttc 100 
Val Leu Asp Gly Leu Thr Gin Gin Ala Glu Leu Gin Leu Pro Leu Phe 
15 20 25 

etc ctg ttc ctg gga ate tat gtg gtc aca gta gtg ggc aac ctg ggc 148 
Leu Leu Phe Leu Gly lie Tyr Val Val Thr Val Val Gly Asn Leu Gly 
30 35 40 



atg att etc ctg att gca gtc age cct eta ctt cac acc ccc atg tac 196 



50/66 



Met lie Leu 
45 

tat ttc etc 

Tyr Phe Leu 

att act ccc 
He Thr Pro 

ctt tac tct 
Leu Tyr Ser 
95 

gtg get gag 
Val Ala Glu 
110 

gec ate tgt 
Ala He Cys 
125 

tgc tea ctg 
Cys Ser Leu 



Leu He Ala Val 
50 

age age ttg tec 
Ser Ser Leu Ser 
65 

aaa atg ctg gtg 
Lys Met Leu Val 
80 

gag tgc atg gtc 
Glu Cys Met Val 

ggt tac etc ctg 
Gly Tyr Leu Leu 
115 

age cca ctg ctt 
Ser Pro Leu Leu 
130 

eta gtg ctg get 
Leu Val Leu Ala 
145 



Ser Pro Leu 

ttc gtc gat 
Phe Val Asp 
70 

aac ttc eta 
Asn Phe Leu 
85 

cag etc ttt 
Gin Leu Phe 
100 

act gec atg 
Thr Ala Met 

tat aat gcg 
Tyr Asn Ala 

gec ttc ttc 
Ala Phe Phe 
150 



Leu His Thr Pro 
55 

ttc tgc tat tec 
Phe Cys Tyr Ser 

gga aag aag aat 
Gly Lys Lys Asn 
90 

ttc ttt gtg gtc 
Phe Phe Val Val 
105 

gca tat gat cgc 
Ala Tyr Asp Arg 
120 

ate atg tec tea 
He Met Ser Ser 
135 

ttg ggc ttt etc 
Leu Gly Phe Leu 



Met Tyr 
60 

tct gtc 244 
Ser Val 
75 

aca ate 292 
Thr He 

ttt gtg 340 
Phe Val 

tat gtt 388 
Tyr Val 

tgg gtc 436 
Trp Val 
140 

tct gec 484 

Ser Ala 

155 
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ttg act cat aca agt gcc atg atg aaa ctg tec ttt tgc aaa tec cac 532 

Leu Thr His Thr Ser Ala Met Met Lys Leu Ser Phe Cys Lys Ser His 

160 165 170 

att ate aac cat tac ttc tgt gat gtt ctt ccc etc etc aat etc tec 580 

lie lie Asn His Tyr Phe Cys Asp Val Leu Pro Leu Leu Asn Leu Ser 
175 180 185 

» 

tgc tec aac aca cac etc aat gag ctt eta ctt ttt ate att gcg ggg 628 

Cys Ser Asn Thr His Leu Asn Glu Leu Leu Leu Phe lie lie Ala Gly 
190 195 200 

ttt aac ace ttg gtg ccc ace eta get gtt get gtc tec tat gcc ttc 676 

Phe Asn Thr Leu Val Pro Thr Leu Ala Val Ala Val Ser Tyr Ala Phe 

205 210 215 220 

ate etc tac age ate ctt cac ate cgc tec tea gag ggc egg tec aaa 724 

lie Leu Tyr Ser He Leu His lie Arg Ser Ser Glu Gly Arg Ser Lys 
225 230 235 



get ttt gga aca tgc age tct cat etc atg get gtg gt 
Ala Phe Gly Thr Cys Ser Ser His Leu Met Ala Val 
240 245 



762 
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<210> 33 
<211> 248 
<212> PRT 

<213> Homo sapiens 
<400> 33 

Met Thr Met Glu Asn Tyr Ser Met Ala Ala Gin Phe Val Leu Asp Gly 
15 10 15 

> 

Leu Thr Gin Gin Ala Glu Leu Gin Leu Pro Leu Phe Leu Leu Phe Leu 
20 25 30 

Gly He Tyr Val Val Thr Val Val Gly Asn Leu Gly Met He Leu Leu 
35 40 45 

He Ala Val Ser Pro Leu Leu His Thr Pro Met Tyr Tyr Phe Leu Ser 
50 55 60 

Ser Leu Ser Phe Val Asp Phe Cys Tyr Ser Ser Val He Thr Pro Lys 
65 70 75 80 

Met Leu Val Asn Phe Leu Gly Lys Lys Asn Thr He Leu Tyr Ser Glu 
85 90 95 



Cys Met Val Gin Leu Phe Phe Phe Val Val Phe Val Val Ala Glu Gly 
100 105 110 
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Tyr Leu Leu Thr Ala Met Ala Tyr Asp Arg Tyr Val Ala lie Cys Ser 
115 120 125 

Pro Leu Leu Tyr Asn Ala lie Met Ser Ser Trp Val Cys Ser Leu Leu 
130 135 140 

Val Leu Ala Ala Phe Phe Leu Gly Phe Leu Ser Ala Leu Thr His Thr 
» 

145 150 155 160 

Ser Ala Met Met Lys Leu Ser Phe Cys Lys Ser His lie lie Asn His 
165 170 175 

Tyr Phe Cys Asp Val Leu Pro Leu Leu Asn Leu Ser Cys Ser Asn Thr 
180 185 190 

His Leu Asn Glu Leu Leu Leu Phe He He Ala Gly Phe Asn Thr Leu 
195 200 205 

Val Pro Thr Leu Ala Val Ala Val Ser Tyr Ala Phe He Leu Tyr Ser 
210 215 220 

He Leu His He Arg Ser Ser Glu Gly Arg Ser Lys Ala Phe Gly Thr 
225 230 235 240 



Cys Ser Ser His Leu Met Ala Val 
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245 

<210> 34 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthsized primer sequence 

<400> 34 

gaagagcagt gagggtccat gttaagg 27 

<210> 35 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 35 
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cagcagcttg tccttcgtcg atttctgc 28 

<210> 36 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 

<400> 36 

gctagggtgg gcaccaaggt gttaaaccc 29 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 37 
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tgcaaaagga cagtttcatc atggcac 27 

<210> 38 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 

<400> 38 

caaagaactc acccaaattc ctacagct 28 

<210> 39 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 39 
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catggtaggc aaccttggct tgatcac 27 

<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 

<400> 40 

gtttattaaa tcacacataa caccatctg 29 

<210> 41 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 41 
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cagagacaga gcaatgacat gagagctac 29 

<210> 42 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence-' Artificially 
synthesized primer sequence 

<400> 42 

caaagaactc acccaaattc ctacagcc 28 

<210> 43 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 43 
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catggtaggc aaccttggct tgatcat 27 

<210> 44 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence-" Artificially 
synthesized primer sequence 

<400> 44 

gtttattaaa tcacacataa caccatctg 29 

<210> 45 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 45 



cagagacaga gcaatgacat gagagctac 29 

<210> 46 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 

<400> 46 

ccagacagct cgccaagaga gaatgac 27 

<210> 47 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence 



<400> 47 
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cctttataga tctctgttat tcctgtgtg 

<210> 48 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
synthesized primer sequence 

<400> 48 

tcggttgcca gtgatatgaa gagaccc 

<210> 49 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
synthesized primer sequence 



<400> 49 
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ggctttggat ctgccctctg cagaagg 27 

<210> 50 
<211> 450 
<212> DNA 

<213> Homo sapiens 
<400> 50 

cagcagcttg tccttcgtcg atttctgcta ttcctctgtc attactccca aaatgctggt 60 
gaacttccta ggaaagaaga atacaatcct ttactctgag tgcatggtcc agctcttttt 120 
ctttgtggtc tttgtggtgg ctgagggtta cctcctgact gccatggcat atgatcgcta 180 
tgttgccatc tgtagcccac tgctttataa tgcgatcatg tcctcatggg tctgctcact 240 
gctagtgctg gctgccttct tcttgggctt tctctctgcc ttgactcata caagtgccat 300 
gatgaaactg tccttttgca aatcccacat tatcaaccat tacttctgtg atgttcttcc 360 
cctcctcaat ctctcctgct ccaacacaca cctcaatgag cttctacttt ttatcattgc 420 
ggggtttaac accttggtgc ccaccctagc 450 



<210> 51 
<211> 637 
<212> DNA 

<213> Homo sapiens 
<400> 51 

catggtaggc aaccttggct tgatcactct 
aatgtactat ttcctcttca atctctcctt 
tcccaaaatg ctaatgaact ttgtgtcaaa 
gactcggctg tttttctttc tctttttcgt 
ggcatatgat cgctatgtgg ccatctgtaa 
tcaggtctgt tctatgctca cttttgctgc 
ccacaccggg tgcatgttta gactcacctt 
gtgtgacata ctccccctcc tccagctttc 
tgttctcatt gttgtgggta ctaatatcac 
tgttttcatt gtcactagca ttcttcatat 
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tttcggtcta aattctcacc tccacacacc 60 



cattgatctc tgttactcct ctgttttcac 120 



aaagaatatt atctccaatg ttgggtgcat 180 



catctctgaa tgttacatgt tgacctcaat 240 



tccattgctg tataaggtca ccatgtccca 300 



ttacataatg ggattggctg gagccacggc 360 



ctgcagtgct aatatcatta accattactt 420 



ctgcaccagc acctatgtca acgaggtggt 480 



ggtacccagt tgtaccatcc tcatttctta 540 



caaatccact caaggaagat caaaagcctt 600 
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cagtacttgt agctctcatg tcattgctct gtctctg 637 

<210> 52 
<211> 637 
<212> DNA 

<213> Homo sapiens 

<400> 52 

catggtaggc aaccttggct tgatcattct tttcggtcta aattctcacc tccacacacc 60 
aatgtactat ttcctcttca atctctcctt cattgatctc tgttactcct ctgttttcac 120 
tcccaaaatg ctaatgaact ttgtatcaaa aaagaatatt atctcctatg ttgggtgcat 180 
gactcagctg tttttctttc tcttttttgt catctctgaa tgctacatat tgacctcaat 240 
ggcatatgat cgctatgtgg ccatctgtaa tccattgctg tataaggtca ccatgtccca 300 
tcaggtctgt tctatgctca cttttgctgc ttacataatg ggattggctg gagccacggc 360 
ccacaccggg tgcatgctta gactcacctt ctgcagtgct aatatcatca accattactt 420 
gtgtgacata ctccccctcc tccagctttc ctgcaccagc acctatgtca acgaggtggt 480 
tgttctcatt gttgtgggta ttaatatcat ggtacccagt tgtaccatcc tcatttctta 540 
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tgttttcatt gtcactagca ttcttcatat caaatccact caaggaagat caaaagcctt 600 
cagtacttgt agctctcatg tcattgctct gtctctg 637 

<210> 53 
<211> 509 
<212> DNA 

<213> Homo sapiens 
<400> 53 

cctttataga tctctgttat tcctgtgtgt ttacccccaa aatgctgaat gactttgttt 60 
cagaaagtat catctcttat gtgggatgta tgactcagct atttttcttc tgtttctttg 120 
tcaattctga gtgctatgtg ttggtatcaa tggcctatga tcgctatgtg gccatctgca 180 
accccctgct ctacatggtc accatgtccc caagggtctg ctttctgctg atgtttggtt 240 
cctatgtggt agggtttgct ggggccatgg cccacactgg aagcatgctg cgactgacct 300 
tctgtgattc caacgtcatt gaccattatc tgtgtgacgt tctccccctc ttgcagctct 360 
cctgcaccag cacccatgtc agtgagctgg tatttttcat tgttgttgga gtaatcacca 420 
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tgctatccag cataagcatc gtcatctctt acgctttgat actctccaac atcctctgta 480 



ttccttctgc agagggcaga tccaaagcc 509 



